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Abstract 

Estrogen  Receptor  alpha  (aER),  binds  estradiol,  initiating  a  cascade  of  events  that  leads  to  aER- 
positive  breast  cancer.  The  aER  binds  the  FDA  approved  drug  for  breast  cancer,  tamoxifen,  and  also 
raloxifene,  with  high  affinity.  The  crystal  structures  of  the  aER-estradiol,  aER-tamoxifen,  and  aER- 
raloxifene  complexes  reveal  that  estradiol  binding  changes  the  conformation  of  helix  12  (HI  2)  so  that  co- 
regulatory  proteins  can  bind  and  activate  transcription,  whereas  tamoxifen  and  raloxifene  binding 
displaces  H12  to  a  different  position  that  prevents  the  binding  of  co-regulatory  proteins. 

We  have  successfully  combined  MD  &  QM  methods  in  an  innovative  way  to  develop  a 
computational  assay  that  will  allow  for  the  development  of  novel  lead  compounds  that  should  be  active 
against  breast  cancer.  Our  test  system  consists  of  estradiol,  tamoxifen,  and  several  known  raloxifene 
analogs.  MD  simulation  were  performed  for  3-5  nanoseconds  on  a  truncated  aER  protein  bound  to 
different  ligands.  These  simulations  generated  input  structures  for  a  variety  of  methods,  ranging  from 
MM  to  QM,  for  determining  energy  the  binding  energy  of  ligands  to  the  aER.  We  have  determined  the 
optimal  method  for  correlating  binding  energies  to  experimental  relative  binding  affinitites  (RBAs).  We 
will  use  this  assay  to  design  novel  inhibitors  with  high-predicted  activity. 


Introduction 

The  estrogen  receptor  (ER),  a  member  of  the  steroid  receptor  nuclear  family,  resides  in  the 
nucleus  of  cells  in  its  inactive  form.  In  the  presence  of  estrogen  it  becomes  activated  and  the  resulting 
ER-estradiol  complex  forms  a  stable  dimer  that  subsequently  initiates  the  transcription  of  target  genes.1 
The  ER  has  been  studied  extensively  and  found  to  have  a  significant  role  on  cholesterol  and  lipid  levels, 
the  skeletal  system,  the  central  nervous  system,  as  well  as  reproductive  functions  in  both  males  and 
females.2"4 

The  site  at  which  estrogen  binds  to  the  ER  is  called  the  ligand  binding  domain  (LBD).5  The  LBD 
is  a  small  hydrophobic  pocket  that  can  accommodate  -450  A.3  Estrogen’s  volume  is  only  245  A3,4  which 
means  that  the  relatively  large  LBD  should  be  able  to  accommodate  and  bind  other  hydrophobic  ligands 
of  greater  size,  and  in  fact,  it  does.  When  another  ligand  binds  to  the  ER  and  mimics  estrogen’s 
physiological  affects,  it  is  called  an  agonist,  and  when  it  acts  as  an  inhibitor,  it  is  called  an  antagonist. 

While  estrogen-mediated  transcription  of  genes  is  essential  for  health,  some  aggressive  human 
breast  and  endometrial  cancers  exhibit  this  same  mechanism  for  growth.6  In  fact,  measurable  amounts  of 
ER  can  be  found  in  75%  of  breast  tumors.7  In  light  of  these  findings,  a  great  deal  of  work  has  been  done 
to  characterize  ligands  that  bind  antagonistically  to  the  ER  in  tumor  cells,  while  behaving  as  an  agonist  in 
other  normal  tissues  in  order  to  minimize  the  antagonist  activity.  The  ER  ligands  that  display  this 
selective  binding  profile  are  called  selective  estrogen  receptor  modulators  (SERMs).  SERMs  have  been 
used  to  effectively  treat  breast  cancer  and  other  estrogen-regulated  disease. 

The  most  popular  SERM  used  clinically  is  tamoxifen.  The  drug  itself  has  only  a  mild  affinity  for 
the  aER,  however  it  undergoes  modification  once  ingested,  which  increases  its  affinity  for  aER  by  100- 
fold.  Tamoxifen  encounters  members  of  the  cytochrome  p450  enzyme  (CYPs)  superfamily  in  the  liver  as 
well  as  the  intestine,  where  it  is  ultimately  metabolized  into  two  products,  4-hydroxytamoxifen  and 
endoxifen,  as  seen  in  Figure  1,  both  of  which  are  100  times  better  at  binding  with  aER  than  tamoxifen.8 

It  has  been  shown  to  inhibit  breast  cancer  and  to  have  several  other  health  benefits.910 
Because  tamoxifen  acts  like  estrogen  in  bone,  it  reduces  bone  resorption  and  consequently  leads  to  higher 
bone  density.9  Many  studies  have  shown  that,  like  estrogen,  tamoxifen  and  other  SERMs  have 
neuroprotective  properties.  The  mechanisms  for  this  protection  are  not  understood,  but  further 
investigation  may  illicit  possible  benefits  on  neurodegenerative  diseases  like  Alzheimer’s  and 
Parkinson’s."  However,  recent  studies  have  linked  tamoxifen  to  dramatically  increased  risks  of 
endometrial  cancer.  In  2004,  Curtis  et  al.  provided  population  based  evidence  that  the  use  of  tamoxifen 
increases  a  patient’s  relative  risk  for  uterine  corpus  cancer  more  than  twofold,  and  for  malignant  mixed 
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mullerian  tumors  by  fourfold.12 


For  this  reason,  researchers  are  looking  for  new  drugs  to  treat  breast  cancer.  A  ‘'second 
generation”  SERM  that  has  promising  qualities  is  raloxifene,  seen  in  Figure  2.  It  has  a  similar  affinity  for 
aER,13  but  acts  as  an  antagonist  both  in  breast  and  endometrial  tissues.14  In  addition,  it  shares  many  of 
the  same  health  benefits  of  tamoxifen.1516  It  has  been  shown  to  inhibit  breast  cancer  and  to  have  several 
other  health  benefits.910  During  the  Multiple  Outcomes  of  Raloxifene  Evaluation  (MORE)  trial,  it  was 
found  that  for  women  who  received  raloxifene,  the  risk  of  invasive  breast  cancer  was  reduced  by  76% 
after  3  years  and  72%  after  4  years  compared  to  the  women  who  received  placebo.1718  Because  tamoxifen 
acts  like  estrogen  in  bone,  it  reduces  bone  resorption  and  consequently  leads  to  higher  bone  density.9 
Many  studies  have  shown  that,  like  estrogen,  tamoxifen  and  other  SERMs  have  neuroprotective 
properties.  The  mechanisms  for  this  protection  are  not  understood,  but  further  investigation  may  illicit 
possible  benefits  on  neurodegenerative  diseases  like  Alzheimer’s  and  Parkinson’s."  However,  recent 
studies  have  linked  tamoxifen  to  dramatically  increased  risks  of  endometrial  cancer.  In  2004,  Curtis  et  al. 
provided  population  based  evidence  that  the  use  of  tamoxifen  increases  a  patient’s  relative  risk  for  uterine 
corpus  cancer  more  than  twofold,  and  for  malignant  mixed  mullerian  tumors  by  fourfold.12 
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Estrogen  therapy  in  postmenopausal  women  has  been  associated  with  higher  risk  of  coronary 
heart  disease  (CHD)  as  well  as  fatal  strokes,20  however  raloxifene  has  not  been  associated  with  these 
risks.  After  the  MORE  trial,  a  post  hoc  analysis  was  done  using  data  from  the  participants  that 
determined  that  patients  who  received  raloxifene  were  not  at  an  increased  risk  of  cardiovascular  events 
when  compared  to  those  who  received  placebos.  For  these  patients,  who  were  considered  high-risk  for 
CHD,  taking  raloxifene  actually  reduced  their  risk  of  a  cardiovascular  incident.  Currently,  there  is  an 
international,  randomized,  double-blind,  placebo-controlled  study  of  over  10,000  postmenopausal  women 
with  increased  risk  of  coronary  heart  disease  called  Raloxifene  Use  in  The  Heart  (RUTH)  that  is 
underway  which  will  determine  whether  raloxifene  actually  has  significant  cardiovascular  benefit  on 
patients.21 


While  raloxifene’s  binding  profile  is  highly  desirable  for  cancer  treatment,  it  is  not  an  effective 
anti-cancer  drug  because  only  2%  of  the  dose  becomes  bioavailable.22  Once  it  enters  the  body,  raloxifene 
faces  a  host  of  obstacles.  Sixty  percent  of  the  drug  is  rapidly  absorbed  after  ingestion,23  but  then  it 
undergoes  rapid  presystemic  glucuronidation  both  in  the  liver  and  the  intestine.  The  process  of 
glucuronidation  is  one  of  the  body’s  mechanisms  for  increasing  substrates  water  solubility,  but  it  acts  as  a 
significant  barrier  in  the  way  of  drug  development.  Research  has  shown  that  raloxifene  is  susceptible  to 
this  process  primarily  at  the  4’  position,  but  also  at  the  6  position,  shown  in  Figure  3.  The  UDP- 
glucuronsyltransferases  (UGTs)  in  the  liver,  primarily  UGT1  Al,  catalyzes  glucuronidation  primarily  at 
the  6’  position  while  intestinal  UGTs,  UGT1 A8  and  UGT1 A 10,  make  4’  glucuronides  preferentially.22 
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The  rate  of  clearance  of  metabolism  is  surprisingly  higher  in  humans  than  in  rats,  considering 
present  in  human  intestine,  which  ultimately  explains  why,  in  rats,  there  is  a  39%  bioavailability 
compared  to  the  human  2%.  These  findings  also  support  the  idea  that  intestinal  enzymes  play  a  more 
critical  role  in  the  metabolism  of  raloxifene  than  those  in  the  liver.24  While  glucuronidation  seems  to  be 
the  major  obstacle  in  the  way  of  bioavailability,  it  has  also  been  shown  that  raloxifene  has  difficulties 
with  transport  and  is  effluxed  by  several  transport  proteins,  such  as  multidrug  resistance  related  protein 
(MRP)  and  organic  anion  transporter  (OAT).25 

One  of  the  aspects  of  ER-ligand  interaction  that  is  yet  to  be  fully  understood  is  exactly  how  one 
ligand  can  act  as  an  agonist  in  one  tissue,  and  an  antagonist  in  other  tissues.  The  existence  of  two 
subtypes  of  ER,  aER  and  aERx,  which  reside  in  different  tissues  helps  to  explain  some  of  this 
selectivity,  but  the  interactions  are  far  more  complex.  Each  ligand  that  binds  to  the  ER  induces  a  unique 
conformational  change.  Different  peptide  binding  sites  are  exposed  in  the  different  complexes;  these 
newly  exposed  sites  are  different  from  other  ligand-ER  complexes,  and  different  from  the  native  ER  as 
well.  This  information  suggests  that  each  protein-ligand  complex  is  involved  in  other  protein-protein 
interactions  that  ultimately  affect  whether  the  bound  ligand  acts  as  an  agonist  or  antagonist.  The 
distribution  of  these  binding  cofactors  among  tissues  may  be  the  reason  for  one  ligand’s  variable  binding 
profile.1 

In  1997,  Grese  et  al.13  published  data  on  76  raloxifene  analogs  that  they  synthesized,  including 
experimental  relative  binding  affinities  for  each  analog,  compared  to  estradiol,  as  well  as  IC5o  values.  In 
the  interest  of  creating  a  molecule  that  shares  raloxifene’s  agonist/antagonist  profile,  but  not  its 
susceptibility  to  degradation,  we  have  used  computational  methods  to  study  the  binding  affinity  of 
several  of  the  Grese  raloxifene  derivatives  that  have  been  tested  for  anti-cancer  properties  (Figure  4). 

Each  analog  was  experimentally  tested  in  two  assays,  a  radioligand  competitive  binding  assay  as  well  as 
an  MCF-7  cell  proliferation  assay,  which  is  used  to  determine  the  ligand  concentration  necessary  to 
inhibit  50%  of  MCF-7  breast  cancer  cells  (IC  5o)» "  These  I^35ovalues  tell  us  which  ligands  are  good  cancer 
drugs,  but  the  RBA  values  obtained  from  the  radioligand  binding  assay  will  give  us  a  comparison  point 
for  the  theoretical  binding  energies. 

One  of  the  most  difficult  aspects  of  computational  modeling  of  drug-protein  interactions  is 
incorporating  protein  flexibility.  Many  programs  try  to  model  the  conformational  flexibility  of  the 
ligand,  while  leaving  the  protein  rigid.  However,  there  are  several  ways  to  model  flexibility  in  a  receptor 
molecule.  One  way,  the  “relaxed  complex”  method,  involves  running  a  long  molecular  dynamics  (MD) 
simulation  of  the  uncoupled  receptor  in  order  to  sample  the  protein’s  unique  conformations.  The  second 
phase  of  this  method  then  involves  rapid  docking  of  the  ligands  into  all  of  the  unique  protein  conformers. 
The  complexes  can  then  be  overlaid  to  evaluate  the  binding  pocket  in  several  modes.26 

Studies  have  also  shown  that  for  the  use  of  Quantum  Mechanical/Molecular  Mechanical 
approaches  to  study  protein-substrate  interactions,  protein  conformational  sampling  is  especially 
important  since  small  fluctuations  in  protein  shape  can  have  a  great  effect  on  the  total  energy.  It  has  been 
suggested  that  the  best  approach  with  energy  calculations  would  be  to  average  the  results  of  several 
protein  conformers  obtained  from  a  long  MD  simulation.27 
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The  main  principle  behind  computational  drug  design  is  that  regardless  of  how  specific  a 
particular  receptor  is  for  its  substrate,  we  can  still  create  a  small  molecule  that  has  a  greater  binding 
affinity.  This  kind  of  assumption  has  moved  away  from  the  historic  lock-and-key  and  induced-fit 
models.  Current  understanding  has  shifted  to  the  idea  that  a  protein  exists  in  an  equilibrium  of 
conformational  states.  It  has  been  suggested  in  other  literature  that  in  order  for  a  ligand  to  have  overall 
favorable  binding  affinity,  the  ligand  may  be  a  moderate  binder  for  a  highly  populated  protein  conformer, 
or  an  excellent  binder  for  a  less  populated  conformer.28,29 


Our  goal  in  this  study  was  to  benchmark  a  methodology  for  computational  modeling  of  the  aER- 
ligand  system.  The  analogs  created  by  Grese  et  al.13  were  used  to  explore  the  binding  site  of  aER.  We 
used  AMBER830  to  study  the  molecular  dynamics  of  the  ER  complexes  with  available  crystal  structures 
(ER-estradiol,  ER-tamoxifen,  ER-raloxifene,  shown  in  Figure  5),  as  well  as  aER  with  the  raloxifene 
analogs.  These  simulations  provided  us  with  information  about  the  conformational  flexibility  of  the 
protein,  as  well  as  the  conformational  freedom  of  the  ligand.  The  relative  stability  of  the  protein  in  its 
entirety  encouraged  us  to  focus  our  energies  on  the  active  site  of  the  protein.  While  it  is  feasible  to  run 
MD  simulations  on  the  whole  protein-ligand  complex,  the  calculations  are  very  expensive.  We  created  a 
simpler  model  of  our  protein  by  cutting  away  all  residues  that  were  outside  of  a  10  A  radius  centered  on 
the  ligand,  and  then  capped  the  end  groups  and  held  them  fixed  in  order  to  preserve  the  active  site 
configuration  during  the  MD  simulations.  This  reduced  the  size  of  our  system  from  4000  atoms  down  to 
1400  atoms.  We  report  the  problems  we  encountered  using  both  methods,  and  suggest  areas  that  require 
careful  attention  in  a  drug  design  project  such  as  this  one. 
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Figure  5:  Top,  estradiol  (LEFT),  raloxifene  (purple)  and  tamoxifen  (blue)  overlaid  in  the  binding  cavity  of  cxER. 
Right,  three  crystal  structures  of  ER-estradiol,  ER-raloxifene,  ER-tamoxifen,  overlaid.  In  each  complex,  the 
protein  takes  on  a  slightly  different  conformation. 


Computational  Methodology 

We  compared  the  crystal  structures  of  estradiol  (1ERE),14  tamoxifen  (3ERT)31  and  raloxifene 
(1ERR),14  as  well  as  9  other  resolved  structures  bound  to  the  ER  and  found  that  not  only  was  the  overall 
ER  protein  structure  very  similar,  and  that  the  binding  mode  for  all  three,  in  particular  the  two  SERMs, 
was  nearly  identical,  as  shown  in  Figure  5.  Due  to  the  high  resolution  of  its  structure  (1.9  A),  the  ER 
from  the  tamoxifen-ER  complex  (3ERT)  was  used  as  our  template.  Also,  based  on  crystallographic 
information  of  the  environment  of  the  ligand,  specifically  its  proximity  to  an  aspartic  acid  (ASP351),  we 
decided  to  protonate  the  amine  tail  of  the  ligands  for  our  calculations.  We  ran  MD  simulations  on  each 
ligand  in  two  systems,  the  enter  ER,  and  then  a  smaller  model  of  the  ER  LBD.  To  analyze  the  effect  of 
protonation  on  the  movement  and  interaction  of  the  amine  tail  of  the  ligands,  we  first  used  VMD  to 
closely  examine  the  MD  simulations.  We  were  able  to  select  all  atoms  that  came  within  3  A  of  the 
hydrogen,  and  from  this  perspective  we  could  select  the  atoms  of  residues  that  came  close  enough  to 
hydrogen  bond.  Once  we  identified  the  different  residues,  we  used  Ptraj  to  calculate  the  distance  between 
atoms  for  all  the  possible  contacts  in  each  simulation,  for  both  the  small  system  as  well  as  the  large 
system. 


Large  System 

In  order  to  obtain  a  good  starting  structure  for  atomic  partial  charge  assignment,  tamoxifen  was 
cut  out  of  3ERT  and  underwent  a  HF/6-31G*  geometry  optimization  using  Gaussian0332  and  then  was 
fitted  with  RESP  charges  in  Antechamber.33  Hydrogens  were  added  to  the  ER  of  3ERT  in  the  LEaP 
module  of  AMBER8.0,  and  then  the  entire  complex  was  put  through  a  SANDER  minimization  using  the 
generalized  amber  force  field  (GAFF)34  for  the  ligand  and  ff99SB35  for  the  protein.  The  energy 
minimization  used  a  standard  GB  implicit  solvation  model,  1 .2  A  electrostatic  scaling,  and  2.0  A  van  der 
Waals  scaling.  We  took  the  minimized  complex,  removed  tamoxifen,  and  then  loaded  the  empty  ER  into 
AutoDockTools.36  We  then  used  model  to  dock  raloxifene  and  its  derivatives  in  the  ligand  binding  site. 
Using  the  genetic  algorithms’  default  parameters,  we  docked  each  ligand  200  times.  For  each  derivative, 
we  chose  the  conformer  that  most  closely  matched  raloxifene’s  crystal  structure  (1ERR).  Each  complex 
was  then  minimized  with  SANDER,  in  the  same  way  as  the  original.  We  ran  MD  simulations  on  each 
complex  for  a  duration  of  800-3000  ps.  The  complexes  were  heated  for  50  ps,  from  0  K  to  300  K, 
followed  by  their  production  runs  at  a  constant  pressure.  Using  Ptraj,  in  AMBER,  the  RMS  deviation  of 
the  ligands  during  the  simulations  were  calculated  in  order  to  determine  the  point  at  which  the  ligand 
could  be  considered  equilibrated  (data  not  shown).  Once  that  data  was  obtained,  we  used  the  MMGBSA 
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scoring  function  in  AMBER  to  calculate  changes  in  free  energy  of  binding  for  each  of  the  ligands  during 
their  equilibrated  trajectory. 


Small  System 

We  used  the  program  VMD39  to  view  the  tamoxifen-aER  crystal  structure,  and  to  select  the 
residues  that  were  within  10  A  of  the  ligand  in  the  binding  pocket.  We  removed  all  other  residues,  except 
for  those  that  seemed  relevant  or  were  connecting  the  remaining  residues,  the  remaining  protein  was 
capped  using  acetyl  and  N-methylamine  groups.  This  reduced  the  size  of  the  protein  from  4000  atoms  to 
1400  atoms  (Figure  6).  Terminal  and  “connecting”  residues  of  the  cut  protein  were  restrained.  The  exact 
ligand  pose  from  the  large  protein  simulation  was  used  for  the  cut  protein  system.  This  was 
accomplished  by  direct  transfer  of  coordinates  for  the  ligand  to  the  pdb  file  of  the  smaller  protein.  Each 
complex  was  minimized  using  SANDER,  and  then  put  through  a  heating  and  a  production  MD 
simulation  lasting  2000  ps,  using  the  same  parameters  as  the  large  system,  except  for  the  restraints 
imposed  on  the  selected  residues.  As  with  the  larger  system,  Ptraj  was  used  to  calculate  RMS  for  the 
ligands,  in  order  to  determine  when  the  ligands  were  equilibrated  (data  not  shown).  To  analyze  these 
runs  we  used  both  the  MMGBSA  scoring  function,  as  before,  as  well  as  the  semi-empirical  DivCon38 
single-point  calculations  (SPCs).  We  pulled  out  snapshots  from  the  MD  simulations,  after  the  ligand’s 
trajectory  was  stable,  every  50  ps.  This  structure  was  then  converted  to  a  .in  file  and  submitted  to 
DivCon.  We  used  the  PM3  operator,  the  SCRF  solvation  method,  a  nine-angstrom  non-bonded  cut-off, 
and  the  cluster  sub-setting  (ncore=l  dbuffl  =5.5  dbuff2=2.5).  The  DivCon  SPCs  were  run  in  triplet  in 
order  to  get  a  binding  energy  (AH  =  Hcoinpicx-  (HreCcptor  +  H|lgand)  )  for  each  snapshot.  These  energies  were 
then  averaged  to  give  us  a  binding  energy  for  a  particular  ligand. 


Figure  6:  Crystal  structure  of  the  alpha  estrogen 
receptor  with  bound  inhibitor.  Purple  indicates  full 
protein,  and  light  blue  is  the  truncated  model  system. 


Results  and  Discussion. 


The  MD  simulations  for  the  large  protein  system  showed  that  the  aER  LBD  remained  fairly  rigid 
throughout  the  production  run.  However,  there  was  more  conformational  flexibility  seen  in  the  ligand’s 
trajectory,  with  some  trajectories  being  more  stable  than  others.  Using  the  RMS  data,  we  calculated  free 
energy  of  binding  for  the  equilibrated  ligand-protein  complexes.  Our  initial  MMGBSA  calculations  from 
the  large  protein-ligand  complex  gave  us  basically  no  correlation  with  experimental  data,  as  shown  in 
Figure  7.  These  poor  results,  combined  with  the  high  computational  expense,  encouraged  us  to  try  and 
simplify  our  system. 
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MMGBSA  Free  Energy  of  Binding  v.  Relative  Binding  . 
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Figure  7:  Correlation  of  theoretical  energies.  Top  left  MMGBSA  energies  for  small  system  plotted  against  DivCon  energies 
for  small  system.  Bottom  left  MMGBSA  energies  for  small  system  plotted  against  large  system  MMGBSA  energies.  Right 
side  displays  AG  (top)  and  AH  (bottom)  MMGBSA  and  DivCon  energy  results  as  a  function  of  experimental  relative  binding 
affinity.' 3  _ 


Qualitative  observation  of  the  MD  simulations  showed  a  significant  stability  for  the  majority  of 
the  protein.  This  led  us  to  believe  that  a  significant  reduction  in  the  size  of  the  protein  would  not  affect 
the  protein-ligand  interaction  that  we  are  attempting  to  model.  After  cutting  down  the  protein  to 
approximately  one  third  of  its  original  size,  and  restraining  the  exterior  residues,  we  ran  MD  simulations 
and  compared  them  to  the  large  protein  simulations.  To  understand  the  qualitative  differences,  we 
compared  the  different  raloxifene  simulations  by  superimposing  the  small  system  on  the  larger  system  in 
VMD  and  then  watching  the  simulations  progress  simultaneously.  Figure  8  shows  snapshots  from  this 
procedure,  at  the  beginning,  middle  and  end.  In  general,  it  appeared  that  the  small  system  held  the 
general  shape  of  the  binding  cavity,  and  that  raloxifene  seemed  to  have  a  very  similar  trajectory  in  both 
proteins.  Based  on  this,  we  assumed  that  the  cut  protein  system  was  a  reasonable  substitution  for  the 
larger  protein,  and  we  could  use  these  simulations  to  determine  the  energetics  of  ligand-protein 
interaction.  However,  when  we  calculated  free  energy  of  binding  using  first  MMGBSA  and  then  change 
in  enthalpy  using  DivCon,  our  results,  again,  did  not  correlate  with  the  experimental  data  (Table  1).  In 
order  to  better  understand  the  discrepancy  in  our  data,  we  decided  to  look  more  closely  at  what  was 
happening  in  the  MD  simulations. 
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Figure  8:  Overlapping  MD  simulation  snapshots  of  aER-raloxifene  (blue)  and  our  modified  aER-raloxifene  (pink). 
To  the  far  left  is  the  MD  simulations  at  0  ps,  middle  at  1000  ps,  right  at  2000  ps. 


Table  1:  Experimental  relative  binding  affinity  (REA)  data  and  theoretical  energies  determined  using 
MMGBSA  and  DivCon. 

small  system  large  system 


RBA 

MMGBSA 

DivCon 

MMGBSA 

(experimental) 

^  G  teal/  mol 

(std) 

uH(kcal/mol) 

dg 

kealmol  (Std) 

9J 

0.005 

-56. 24 

(3.4) 

20.17 

-49.8 

(4.0) 

5b' 

0.008 

-54. 78 

(5.6) 

21.12 

-49.4 

(4.6) 

23c 

0.012 

-47.9 

(5.7) 

31.58 

-56.2 

(4.7) 

16b 

0. 056 

-55.61 

(3.7) 

26.51 

-51.7 

(4.1) 

5b 

0.073 

-53.58 

(4.1) 

28.02 

-52.6 

(4.2) 

23ja 

0.100 

-51.79 

(3.6) 

24.82 

-51.9 

(4.0) 

up 

0.130 

19.56 

-58.1 

(4.7) 

ralox 

0.340 

-51.07 

(3.1) 

25.54 

-55.1 

(3.7) 

lid 

0.400 

-54.63 

(4.1) 

27.54 

The  Ptraj  RMS  data  on  the  ligands  (data  not  shown)  allowed  us  to  quantitatively  examine  the 
movements  of  the  different  ligands.  If  we  examine  just  the  large  protein  MD  runs,  we  see  that  the 
strongest  binders,  as  predicted  by  RBA,  show  the  most  stable  trajectories.  Raloxifene  and  tamoxifen 
show  an  initial  jump  in  RMS,  but  stabilize  by  200  and  300  ps  respectively.  The  best  binder,  1  Id,  shows 
almost  no  fluctuation  from  the  very  beginning  of  the  simulation.  The  less  potent  raloxifene  analogs  show 
more  varied  trajectories  suggesting  possibly  a  less  tight  fit.  If  we  look  at  these  same  ligands  in  the 
smaller  protein  system,  we  see  different  results.  Several  of  the  weak  binders  show  very  stable  trajectories 
(9j,  5b’,  16b),  while  raloxifene  and  1  Id  seem  to  have  more  fluctuation.  If  we  compare  the  two  different 
simulations  for  the  same  ligand,  we  see  some  important  differences  in  the  strong  binders.  Raloxifene 
jumps  up  to  a  higher  RMS  value  around  3A  and  stays  there  in  the  large  protein,  while  in  the  small  protein 
raloxifene’s  RMS  values  are  far  more  varied  but  most  of  the  time  are  between  1 A  and  2A.  The  opposite 
is  true  of  the  experimentally  best  binder,  1  Id,  whose  large  protein  simulation  keeps  the  ligand  at  or  below 
1 A  RMS,  but  in  the  small  system  the  ligand  stays  at  approximately  2. 5  A,  but  has  more  fluctuations.  The 
RMS  values,  by  themselves,  are  not  as  telling  because  they  are  dependent  on  how  good  the  initial  fitted 
structure  is.  The  part  of  the  ligand  that  moves  the  least  is  the  part  tucked  up  into  the  protein,  and  consists 
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primarily  of  three  aromatic  rings  that  are  fairly  rigid.  The  part  of  the  ligand  that  rotates  and  provides 
most  of  the  movement  is  the  “tail”  that  has  six  rotatable  bonds  and  exposed  to  bulk  solvent.  To  further 
understand  the  MD  simulations  of  these  cut  protein  systems,  we  looked  at  the  protonated  amine  group  on 
the  tail  section  of  the  ligand. 

Surprisingly,  the  protonated  amine  invariably  moves  away  from  ASP35 1  and  hydrogen  bonds  to 
other  oxygen  atoms  throughout  the  binding  pocket,  as  well  as  one  nitrogen  and  one  sulfur  (see  Figure  9). 
The  protonated  amine  for  the  majority  of  the  simulations,  forms  an  m/ramolecular  hydrogen  bond.  The 
tail  of  the  ligand  folds  back  onto  itself  and  it  able  to  hydrogen  bond  to  an  oxygen  atom  in  the  tail  (see 
Figure  10).  The  protonated  amine  tail  has  a  pKa  of  approximately  four,  which  means  that  when  it  is 
isolated  in  solution,  at  a  physiological  pH,  the  majority  of  the  ligands  would  not  be  protonated.  This 
intramolecular  hydrogen  bond  that  we  see  predominate  in  most  of  the  simulations  is  likely  an  artificial 
result.  This  is  a  very  difficult  issue,  since  the  protonation  state  of  the  ligand's  tail  is  likely  to  depend  on 
dynamics.  Because  the  tail  portion  of  the  ligand  is  solvent  accessible,  we  expect  that  the  amine  tail  is 
protonated  in  the  right  environment,  such  as  in  proximity  to  ASP351,  but  unprotonated  when  completely 
solvated  by  bulk  water.  At  best,  it  is  uncertain  that  the  amine  portion  of  the  ligand  would  be  protonated 
in  all  of  the  conformations  visited  by  the  MD  simulations.  This  result  lead  us  in  several  different 
directions  looking  for  an  explanation. 


Figure  9:  (Right  )The  large  protein  simulations  show  less  broad  interactions  between  the  protonated  amine  tail  of  the 
ligand  and  the  binding  cavity  residues  when  compared  to  the  (Left)  small  protein  simulations.  The  atoms  on  the 
residues  that  interact  with  the  proton 


In  some  of  the  simulations,  the  initial  ligand  position  obtained  using  AutoDock  Tools  did  not 
place  the  hydrogen  atom  close  enough  to  the  aspartic  acid  to  be  within  hydrogen  bonding  distance.  We 
assumed  that  the  MD  simulation  would  form  the  hydrogen  bond  over  the  course  of  the  simulation,  but 
this  was  not  the  case.  However,  even  in  the  simulations  that  began  with  that  hydrogen  bond  intact,  the 
ligand  ultimately  changed  conformation.  We  first  tried  to  ascertain  whether  or  not  the  binding  mode 
includes  the  hydrogen  bond  to  ASP351,  as  is  suggested  in  the  1ERR  crystal  paper.14  We  searched  the 
Protein  DataBank  for  all  crystal  structures  with  the  aER  LBD  bound  to  a  ligand.  We  selected  12  crystals 
with  ligands  similar  to  raloxifene  or  estrogen  and  overlaid  them  in  VMD.  Figure  1 1  shows  the 
remarkably  close  conformation  adopted  by  all  of  the  ligands,  even  with  all  of  the  molecular  differences. 
This  seems  to  provide  additional  support  for  that  particular  binding  mode,  with  the  hydrogen  bond 
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between  ASP35 1  and  a  proton  donor  on  the  ligand.  There  are  two  possibilities  for  why  our  simulations 
disrupt  this  bond:  the  force  fields  that  we  are  using  are  inadequately  representing  the  energetics  of  the 
hydrogen  bond,  or  the  crystal  structure  is  only  accurate  for  the  solid  phase  complex,  and  is  unrealistic  at 
room  temperature  in  solution.  We  plan  to  test  the  second  hypothesis  by  calculating  energies  based  solely 
on  the  MD  snapshots  that  maintain  the  hydrogen  bond  with  ASP35 1 . 


Figure  10:  Example  conformer  pulled  from  the  raloxifene  and  cut  protein  MD 
simulation  showing  how  the  protonated  raloxifene  curls  back  on  itself  and  makes  an 
intramolecular  hydrogen  bond. 
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Since  we  have  very  poor  correlation  to  experimental  data  using  two  different  energy  calculations, 
we  compared  our  theoretical  results  to  each  other.  In  Figure  7  we  first  show  the  MMGBSA  results 
plotted  against  the  DivCon  results  for  the  small  system.  The  two  different  methods,  and  different  energy 
quantities,  show  reasonable  positive  correlation.  That  is  to  say  that  both  methods  agree  on  the  ligands 
that  are  strong  binders,  and  those  that  are  poor  binders. 

Recent  Results  and  Discussion. 

Recently,  we  have  obtained  positive  correlation  between  our  calculated  ensemble  PM3  energies 
and  the  experimental  RBA  values  determined  by  Grese  and  coworkers.13  This  was  accomplished  thanks 
to  two  changes  in  our  protocol.  First,  we  used  the  smaller  model  protein  to  approximate  the  full  ER. 
Second,  we  immersed  the  whole  system  in  an  octahedral  box  of  TIP4P  waters.  The  force  fields  and 
simulations  protocols,  aside  from  the  changes  required  for  an  explicit  periodic  boundary  simulation,  are 
identical  to  those  outlined  above.  Figure  12  displays  the  binding  energy  (AH°298k)  versus  the  natural  log 
of  the  RBA  of  several  ligands,  along  with  the  best-fit  line  and  its  R2  value.  (If  we  remove  the  bottom 
most  right  point,  our  correlation  goes  above  R2=0.7)  The  existence  of  this  correlation  is  very  promising, 
and  suggests  several  things  concerning  our  development  of  a  computational  assay.  First,  that  our  model 
of  the  estrogen  receptor  (ER)  protein  is  sufficient  for  modeling  the  important  interactions  between  the  ER 
and  the  ligands.  Second,  our  protocols  for  determining  the  ligand's  partial  atomic  charges  and  the 
docking  of  these  ligands  into  the  binding  site  are  satisfactory  for  the  MD  simulations.  Third,  the 
simulations  are  sufficiently  sampling  the  local  potential  energy  surface  for  obtaining  an  ensemble  of 
conformations.  Finally,  that  the  semiempirical  PM3  method  as  implemented  in  DivCon  can  be  used  to 
provide  binding  energies  that  when  averaged  over  an  ensemble  of  conformations  corresponds  well  to 
experimental  RBA  trends. 


In  RBA  (exp.) 


Figure  12:  PM3  ensemble  binding  energies  (kcal/mol)  versus  the  natural  log  of 
the  experimental  RBAs  from  several  ligands  reported  in  reference  13.  The  best  fit 
line,  its  equation,  and  the  /<“'  factor  are  also  present. 


Currently  we  are  continuing  our  analysis  of  each  octrahedral  box  MD  simulation  in  order  to 
understand  the  details  of  each  of  the  ligands  interactions  with  the  protein.  We  hope  to  be  able  to  isolate 
specific  interactions  that  can  be  used  to  create  promising  molecular  leads  for  drug  discovery.  We  also 
hope  to  improve  the  correlation  by  including  more  ligands,  and  by  enlarging  the  ensemble  that  is  used  in 
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determining  the  binding  energy  for  each  point.  We  hope  to  obtain  an  R2  value  above  0.8. 


#  K 


Figure  13:  (Left)  Octahedral  box  surrounding  the  small  ER  model  of  the  protein.  (Right)  Detailed  view  of  the  ligand  in 
the  ER  binding  domain  with  important  mediating  waters  distinguished  in  gold. 


A  few  interactions  are  readily  identified  as  important  for  stabilizing  the  interaction  between  the 
protein  and  the  ligand.  A  few  important  water  molecules  bridge  the  ligands  to  the  protein.  The  presence 
of  waters  around  arginine  and  glutamic  residues  in  the  binding  site  helps  to  maintain  a  very  strong 
hydrogen  bond  interaction  between  glutamic  acid  and  the  ligands'  6-OH.  (Figure  1 3)  Normally,  the 
arginine  and  the  glutamic  acids  formed  a  strong  hydrogen  bond  in  our  previous  implicit  solvations. 
During  these  implicit  solvation  simulations,  the  hydrogen  bond  between  the  ligands'  6-OH  group  and  the 
glutamic  acid  is  initially  present,  but  is  quickly  lost  for  the  remainder  of  the  situation.  In  these  explicit 
simulations  the  waters  disrupt  this  amino  acid-amino  acid  hydrogen  bond,  allowing  the  glutamic  acid- 
ligand  hydrogen  bond  to  be  long  lived.  Another  interesting  fact  is  that  the  ligand  tail  remains  completely 
solvated  during  these  explicit  water  solvations.  In  the  implicit  solvations  simulations,  the  tail  of  the 
ligand  had  great  mobility  and  made  hydrogen  bond  contacts  with  the  protein  in  several  locations.  In  the 
explicit  solvation  the  tail  gains  stability  in  its  motion,  which  appears  to  help  our  determination  of  the 
ensemble  averaged  enthalpy  of  bindings. 

In  summary,  we  have  created  a  computational  assay  for  the  binding  of  ligands  to  the  estrogen 
receptor.  Figure  14  displays  our  current  methodology  for  running  a  ligand  through  our  computational 
assay. 
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Optimize  ligand  (HF-6-31G*),  assign  partial  atomic 
charges,  atom  types  using  Antechamber. 


G 


) 


Dock  the  ligand  into  the  empty,  sander  minimized  whole 
ER  LBD  from  3ERT  crystal  using  Autodock 


(j 


Select  pose  that  fits  most  dosely  to  raloxifene's  crystal  (1ERR). 
This  is  accomplished  by  determiningthe  RMSdfit  using  ProFit 
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Assign  parameters  (GAFF  for  the  ligand,  and  modified  Parm99  for  the  protein), 
solvate  the  system  in  an  octahedral  box  of  TIP4P  waters  using  XleAP. 
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Perform  an  constant  volume  MD  simulation  for  50  picoseconds,  heating  from 
5K  to  300K,  and  then  equilibrate  for  an  additional  50  picosecond  using  Sander. 


0 


Perform  a  constant  pressure  MD  simulation  for  5000  picoseconds  using  Sander.  ) 


(^Use  Ptraj  to 


remove  all  of  the  waters  from  the  MD  simulation. 
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Locate  regions  where  the  ligand  possesses  dynamic  stability  in 
the  MD  simulation  using  RMS  data  generated  by  Ptraj. 
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C Extract  snapshots  out  every  50  psec  from  selected  \ 
dynamically  stable  regions  In  the  MD  slmultalon.  ) 


(Run  \ 


single  point  PM3  calculations  on  the  extracted  complexes,  then  the  receptor 
and  ligand  to  obtain  aH  of  binding,  This  Is  done  using  DivCon. 


) 


Figure  14  Flow  diagram  of  determination  of  the  enthalpy  of  binding  of  a  ligand 
to  the  estrogen  receptor. 


Key  Research  Accomplishments 

•  Development  of  a  computational  model  of  the  Estrogen  Receptor  binding  domain. 

•  Performed  several  long  molecular  dynamics  simulations  (800  ps  -  3  ns)  on  the  ligand-ER  system 
using  ligands  with  known  relative  binding  affinities. 

•  Determined  the  optimal  method  for  correlating  binding  energies  to  experimental  relative  binding 
affinities. 

•  Creation  of  a  computational  assay  for  ligand  binding  that  involves  combining  molecular  dynamics 
and  quantum  mechanics 
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Pickard  IV,*  Chantelle  Rein,*  Steven  Feldgus,  Karl  N.  Kirschner  and  George  C.  Shields,  in 
preparation  for  Journal  of  Physical  Chemistry  B  . 

2.  "Computational  Design  and  Experimental  Discovery  of  an  Anti-estrogenic  Peptide  Derived  from 
Alpha-Fetoprotein"  Karl  N.  Kirschner,  Katrina  W.  Lexa,*  Amanda  M.  Salisburg,*  Katherine  A. 
Alser,*  Leroy  Joseph,  Thomas  T.  Andersen,  James  A.  Bennett,  Herbert  I.  Jacobsen,  and  George 

C.  Shields,  in  preparation  for  Nature. 

3.  "The  Nature  of  the  Dissociation  of  H20  in  Water"  Sarah  T.  Tschampel,*  Frank  C.  Pickard  IV,* 
Jennifer  R.  Derby,*  and  George  C.  Shields,  in  preparation. 

4.  "In  Search  of  CS2(H20)„,  n=l-4  Clusters"  Gregory  M.  Hartt,*  Timothy  M.  Evans,*  Karl  N. 
Kirschner  and  George  C.  Shields,  J.  Chem.  Phys.  submitted  9/6/06,  in  revision. 

5.  "Do  Hydroxyl  Radical-Water  Clusters,  OH(H20)„,  n=l-5,  Exist  in  the  Atmosphere?"  Marco  A. 
Allodi,*  Meghan  E.  Dunn,*  Jovan  Livada,*  Karl  N.  Kirschner  and  George  C.  Shields,  J.  Phys. 
Chem.  A  accepted  for  publication  9/28/2006. 

6.  "Comparison  of  CCSD(T),  W 1 ,  and  other  Model  Chemistry  Predictions  for  Gas  Phase 
Deprotonation  Reactions"  Frank  C.  Pickard  IV,*  Daniel  R.  Griffith,*  Skylar  J.  Ferrara,*  Matthew 

D.  Liptak,*  Karl  N.  Kirschner  and  George  C.  Shields,  Int.  J.  Quantum  Chem.  106  (2006)  3122- 
3128. 

7.  "Exploration  of  the  Potential  Energy  Surfaces,  Prediction  of  Atmospheric  Concentrations,  and 
Vibrational  Spectra  of  the  H02***(H20)n  (n=l-2)  Hydrogen  Bonded  Complexes"  Kristin  S. 
Alongi,*  Theodore  S.  Dibble,  George  C.  Shields  and  Karl  N.  Kirschner,  J.  Phys.  Chem.  A  1 10 
(2006) 3686-3691. 

8.  "Computational  design  of  a  small  peptide  that  inhibits  breast  cancer:  An  overview  of 
computational  chemistry  research  at  Hamilton  College."  38th  ACS  Middle  Atlantic  Regional 
Meeting,  June  5,  2006.  Presented  by  George  C.  Shields 

9.  “Analysis  of  the  Affinity  of  Raloxifene  and  its  derivatives  to  the  a-Estrogen  Receptor  Ligand 

*  * 

Binding  Domain,”  Karilyn  Larkin,  Amy  Barrows,  Karl  Kirschner  and  George  Shields  at  the 
46th  annual  Sanibel  International  Symposium  on  Atomic,  Molecular,  Biophysical,  and  Condensed 
Matter  Theory,  March  2006,  St.  Simons,  GA.  Presented  by  K.  Larkin  and  won  the  Outstanding 
Poster  Presentation  Award. 

10.  “Generalized  Born  Molecular  Dynamics  and  MMPBSA  Analysis  of  the  a-Estrogen  Receptor 

*  * 

bound  to  Raloxifene,  Tamoxifen  and  Estradiol,”  Amy  Barrows,  Karilyn  Larkin,  Karl 
Kirschner,  and  George  C.  Shields  at  the  Fourth  MERCURY  Conference  in  Computational 
Chemistry,  July  2005,  Clinton,  NY.  Poster  presented  by  A.  Barrows. 
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1 1 .  “Analysis  of  the  Affinity  of  Raloxifene  and  its  derivatives  to  the  a-Estrogen  Receptor  Ligand 

*  * 

Binding  Domain,”  Karilyn  Larkin,  Amy  Barrows,  Karl  Kirschner  and  George  Shields  at  the 
Fourth  MERCURY  Conference  in  Computational  Chemistry,  July  2005,  Clinton,  NY.  Presented 
by  K.  Larkin. 

12.  Karilyn  Larkin  -  Chemistry  Thesis:  “Analysis  of  the  Affinity  of  Raloxifene  and  its  Derivatives  to 
the  a-Estrogen  Receptor  Ligand  Binding  Domain:  A  Model  for  Drug  Design"  2005-2006. 
Currently  pursuing  an  M.D.,  Albany  Medical  School,  NY. 

Conclusion 

We  have  successfully  combined  MD  &  QM  methods  in  an  innovative  way  to  develop  a 
computational  assay  that  will  allow  for  the  development  of  novel  lead  compounds  that  should  be  active 
against  breast  cancer.  We  have  determined  the  optimal  method  for  correlating  binding  energies  to 
experimental  relative  binding  affinitites  (RBAs).  This  assay  will  be  used  to  design  novel  inhibitors  with 
high-predicted  activity.  Examining  the  dynamics  of  the  ligands  that  exhibit  high  affinity  versus  those  that 
do  not,  we  will  be  able  to  determine  the  important  ligand-protein  interactions.  This  will  allow  for  the 
specific  modifications  to  be  made  to  this  ligands  in  hopes  of  enhancing  the  binding  affinities.  The  new 
ligands  will  be  tested  in  our  computational  assay,  and  the  promising  ones  will  be  synthesized  and  tested 
in  experimental  ER  assays. 


Future  Directions 

We  will  continue  to  refine  our  assay,  by  including  more  MD  data  points  in  our  determination  of 
the  enthalpy  of  binding.  We  will  also  investigate  the  protonation  state  of  the  amine  on  the  tail  of  our 
analogues.  Now  that  we  observe  that  the  tail  stays  solvated  during  the  simulation,  we  believe  that  the 
amine  will  exist  in  the  unprotonated  state.  It  is  unlikely  that  this  will  alter  the  correlation. 

We  will  perform  our  computational  protocol  on  new  ligands  from  the  Renaud  paper19  for 
protonated  and  unprotonated  runs.  Future  work  will  involve  a  collaboration  with  medicinal  chemists  who 
will  synthesize  the  new  molecules,  test  them  in  an  ER  assay,  and  evaluate  the  most  promising  compounds 
against  the  MCF-7  breast  cancer  cell  line. 
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NSF-MRI  9/1/01-8/31/04  “Acquisition  of  High  Performance  Computers  for  the  Northeastern 
Undergraduate  Research  Chemistry  Consortium” 

MERCK/AAAS  Undergraduate  Research  Program,  7/1/00-6/30/03,  (w/  J.  Garrett,  Biology) 
“Merck/AAAS  Hamilton  Summer  Research  Scholars  Program” 

Camille  and  Henry  Dreyfus  Scholar/Fellow  Program,  7/1/00-9/30/02,  “Quantum  Biology  & 
Computational  Biochemistry” 

NIH-AREA,  6/15/99-7/14/02,  “Computational  Design  of  Haptens  to  Destroy  Cocaine” 

American  Chemical  Society/Petroleum  Research  Fund  7/1/98-8/31/00  “Catalytic  Antibodies: 
Correlation  of  Hapten  Structure  with  Induction  of  Catalysis” 

North  Atlantic  Treaty  Organization  1/1/98-12/31/99,,  Collaborative  Research  Grant  between  the 
NATO  countries  Spain,  England,  and  the  United  States.  “Studies  on  DNA  (t)  Structure  and  Reactivity” 

Spanish  Ministry  of  Science  and  Education,  6/1/95-5/30/96, ,  Sabbatical  Research  Award 

National  Science  Foundation  1994-1996  “Development  of  Computational  Chemistry  and 
Computer  Graphics  Laboratory  Experiments  and  Manuals” 

NIH-AREA  1992-1995  “Cloning  and  Crystallization  of  Isoschizomers  of  Pst  I” 

Research  Corporation  1990-1992  “Crystallization  &  Characterization  of 
Circumsporozoite  Proteins  of  the  Malaria  Parasite  P.  falciparum" 
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Since  1990 1  have  received  -  in  external  grant  support  for  20  proposals  funded  by  NSF,  N1H, 
ACS/PRF,  Dreyfus  Foundation,  Research  Corporation,  and  other  funding  agencies. 


HONORS  AND  AWARDS 


2005 

2003 

2000-02 

1995-96 

1993 

1985 

1984 

1984 

1983 

1980 

1978 

1977 


Editorial  Board  of  Computing  Letters 
Winslow  Chair  in  Modern  Science 
Camille  and  Henry  Dreyfus  Scholar 

Spanish  Ministry  of  Education  and  Science  Sabbatical  Award 

William  L.  Dunn  Award  for  Outstanding  Teaching  and  Scholarly  Promise,  awarded  by  an 
elected  committee  of  faculty  at  LFC 

Conoco/Du  Pont  Fellowship,  awarded  by  GT  Chemistry  Faculty 

Elected  full  member  of  Sigma  Xi  Research  Society 

Sigma  Xi  Outstanding  Master’s  Thesis  Award  in  Science 

Omicron  Delta  Kappa  (ODK),  National  Leadership  Honorary  Society 

Coaches  Award  in  Lacrosse,  for  Leadership  On  and  Off  the  Field 

Full  Four  Year  Tuition  Baker  Scholarship  Recipient 

New  York  State  Regents  Scholarship 


UNDERGRADUATE  TEACHING 


Hamilton  College 

Physical  Chemistry;  Physical  Chemistry  Laboratory;  Principles  of  Chemistry;  Biological  Chemistry; 
Research. 


Lake  Forest  College 

Physical  Chemistry  1989-1998;  Physical  Chemistry  Laboratory  1989-1998;  Principles  of  Chemistry 
1991-1998;  Topics  in  Biochemistry  1990-1998;  Research  &  Senior  Thesis  in  Chemistry  1989-1998; 
General  Chemistry  Laboratory  1989-1990,  1993;  Principles  of  Chemistry  Project  Laboratory  1991-1993; 
Co-director,  Richter  Apprentice  Scholar  Summer  Program  1992,  1994. 


NA  TIONAL  SPONSORED  STUDENT  A  WARDS  -  Since  1995  my  students  have 
received  twelve  national  awards,  including  two  Fulbrights  and  six  Goldwaters. 

Barry  M.  Goldwater  Scholarship,  Daniel  Griffith,  2006-07, 

Barry  M.  Goldwater  Scholarship,  Mary  Beth  Day,  2005-07, 

Barry  M.  Goldwater  Scholarship,  Meghan  Dunn,  2005-06, 

Fulbright  Grant,  Spain,  Sarah  Taylor,  2003-04,  living  expenses  for  a  year  in  Barcelona 
Pfizer  Summer  Undergraduate  Research  Fellow,  Matthew  Liptak,  2002, 

Barry  Id.  Goldwater  Scholarship,  Matthew  Liptak,  2002-03, 

Barry  M.  Goldwater  Scholarship,  Lorena  Hernandez,  2001-03, 

Barry  M.  Goldwater  Scholarship,  Ann  Marie  Toth,  2001-02, 

ACS  Scholars,  Lorena  Hernandez,  1 999-2002, 

Gates  Millennium  Scholar,  Lorena  Hernandez,  2000-01, 

Fulbright  Grant,  England,  Edward  Sherer,  1 997-98,  living  expenses  for  a  year  in  Nottingham 
CUR  Summer  Research  Fellowship,  Gordon  Turner,  1995,  (declined) 
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UNDERGRADUATE  RESEARCH  DIRECTION  —  Since  1989 1  have  supervised  a  total  of  110 
undergraduates  (58  F)  and  3  high  school  students  in  research  projects.  Of  49  graduates  (27F),  21  have 
entered  PhD  programs  (10F);  two  entered  Law  School  (2F);  thirteen  entered  professional  schools  for 
health  professions  (9F),  seven  entered  MS  or  MAT  programs  (4F);  and  three  are  secondary  school 
teachers  (F). 

Hamilton  College  -  supervised  92  undergraduates  (49  F)  &  3  secondary  school  students 


Graduates  Of  31  HC  graduates  (18  F),  11  entered  Ph  D.  programs  (6F);  two  entered  Law 

School  (2F);  10  entered  professional  schools  for  health  professions  (7F);  two  are 
high  school  chemistry  and  physics,  or  math  teachers  (F). 

2006 

Valery  Danilack,  Senior  Thesis  in  biochemistry,  "Testing  the  Estrogen  Receptor  Binding  Affinity 
of  Pharmacophores  of  Raloxifene  in  Search  of  Anti-Breast  Cancer  Drugs"  M.S.  Public  Health, 
Yale  University 

Ashley  Deline,  Senior  Thesis  in  chemistry,  "  Molecular  Dynamic  Simulations  of  a  Glycoform  and 
its  Constituent  Parts  Related  to  Rheumatoid  Arthritis"  Fred  Hutchingson  Cancer  Center 

Meghan  Dunn,  Senior  Thesis  in  chemistry,  "Hydrated  N2  and  02  Complexes:  Implications  of 
Weakly  Bound  Species  in  Atmospheric  Chemistry"  Ph.D.  Chemistry,  University  of  Colorado, 
Boulder. 

Karilyn  Larkin,  Senior  Thesis  in  chemistry,  "Analyses  of  the  Affinity  of  Raloxifene  and  its 
Derivatives  to  the  a-Estrogen  Receptor  Ligand  Binding  Domain:  A  Model  for  Drug  Design" 

M.D.  Albany  Medical  School 

Amber  Gillis,  Biochemistry,  MAT  Chemistry  &  Biology,  Union  College 
Rebecca  Shepherd,  Biology,  work. 

2005 

Timothy  Evans,  Senior  Thesis  in  chemistry,  "Hydrated  Complexes  of  Atmospheric  Sulfur 
Species"  M.S.  Mathematics,  NYU 

JunChan  Hong,  Chemistry,  Analyst  at  UBS  Financial  Services 

Scott  Huntington,  Senior  Thesis  in  chemistry,  "Computer  Aided  Design  of  Selective  Estrogen 
Receptor  Modulators",  NYU  Medical  School 

David  Kelland,  Futures  Trader,  Gelber  Group 

Katrina  Lexa,  Senor  Thesis  in  chemistry,  "Conformational  Analysis  of  Peptides:  Modeling  a 
Breast  Cancer  Inhibiting  Peptide  derived  from  AFP",  M.S.  Public  Health,  U.  Michigan;  Ph.D. 
Medicinal  Chemistry,  U.  Michigan. 


Frank  Pickard,  Senior  Thesis  in  chemical  physics,  "The  Enediyne  Anticancer  Antibiotics:  A 
Study  of  the  Bergman  Cyclization  Energy  Barriers  of  Esperamicin  Ai"  Research  Assistant, 
Hamilton  College;  Ph.D.  Physical  Chemistry,  U.  Georgia. 
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Hima  Poonati,  Chemistry,  St.  George's  Medical  School. 

Andreu  Viader  Vails,  Biochemistry,  Ph.D.  Biochemistry,  Washington  University 
2004 

Gabrielle  Markeson,  Senior  Thesis  in  chemistry,  “Anti-Breast  Cancer  Drug  Design”  Eltron 
Research  in  Boulder,  CO.  Fordham  Law  School,  2005. 

Emma  Pokon.  minor  in  Chemistry,  Law  School,  University  of  Vermont. 

Brent  Matteson,  Chemical  Physics,  Graduate  School  at  Oregon  State  University 

Sinan  Misirli,  Senior  Thesis  in  chemistry,  “Theoretical  pKa  Calculations  of  Ionic  Species:  Implicit 
and  Explicit  Solvation  Methods”  Work. 


2003 

Damien  Ellens,  Senior  Thesis  in  chemistry,  “Conformational  Analysis  of  Cyclic  Peptides: 
Evaluating  the  relationship  between  structure  and  energetics  using  Molecular  and  Quantum 
Mechanics”  Yale  Medical  School. 

Lorena  Hernandez,  biochemistry,  Goldwater  Scholar,  2001-02;  currently  MD/PhD  student  at 
Albert  Einstein  College  of  Medicine. 

Matthew  Liptak,  Senior  Thesis  in  chemistry,  “Modeling  the  Inhibition  of  Cdc25B:  Incorporating 
QM/MM  into  Rational  Drug  Design”  Seven  publications  from  his  undergraduate  research, 
publications  33,  34,  36,  38,  39,  43  &  48;  Goldwater  Scholar,  2002-03;  currently  graduate  student 
in  chemistry.  University  of  Wisconsin  (Tom  Brunold). 

Chantelle  Rein,  Senior  Thesis  in  biochemistry,  “An  Investigation  into  the  Usefulness  of  the 
ONIOM  QM/MM  Method  in  Determining  the  Energetic  Pathways  of  Esperamicin  A,”  Graduate 
school  in  biochemistry,  Oregon  Health  Sciences  University. 

Sarah  Taylor,  Senior  Thesis  in  biochemistry,  “  Computational  Approaches  to  Anti-Breast  Cancer 
Drug  Design”  Fulbright  Award  to  Barcelona,  Spain  (Modesto  Orozco,  2003-04).  Medical 
School  at  the  NYU  (2004). 

2002 

Beth  Hayes,  Senior  Thesis  in  chemistry,  “The  Mechanisms  of  DNA  Cleavage  by  Diradicals.” 
One  publication  from  her  undergraduate  research,  publication  #45;  Currently  enrolled  at  the  New 
York  College  of  Osteopathic  Medicine. 

Jaime  Skiba,  Senior  Thesis  in  chemistry,  “A  Computational  Analysis  of  the  AFP-derived 
Biologically  Active  Octamer  and  Some  Derivatives.”  One  publication  from  her  undergraduate 
research,  publication  #45;  High  School  Chemistry  &  Physics  Teacher  at  the  High  School  for 
Environment,  Business,  and  Technology  in  Williamsburg,  NYC. 


Annie  Toth,  biochemistry.  Publication  #33  from  her  1999-2000  research  year;  Goldwater 
Scholar,  2001-02;  currently  graduate  student  in  molecular,  cellular  and  developmental  biology  at  t 
he  University  of  California,  Santa  Barbara. 
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2001 


Jennifer  Derby,  Senior  Thesis  in  chemistry,  “The  Application  of  Electrostatic  Potential  to  Drug 
Design.”  Currently  working  as  an  Applications  Specialist  at  a  lab  equipment  company.  Thermo 
IEC. 

2000 

Sam  Bono,  Senior  Thesis  in  chemistry,  “The  Computational  Analysis  of  the  Biologically  Active 
8-mer  Peptide  of  Alpha-fetoprotein.”  Publication  #35  from  his  undergraduate  research;  O.D. 
Pennsylvania  College  of  Optometry. 

Ryan  Elias,  Senior  Thesis  in  chemistry,  “Modelling  the  Hydroxylase-Phenol  Component  of 
Methane  Monooxygenase:  A  Preliminary  Study  of  the  Effectiveness  of  DFT  Calculations.” 
Alternate  for  Fulbright  to  Italy.  Worked  for  Eziba  in  NYC.  Currently  graduate  student  in  food 
science  at  the  University  of  Massachusetts,  Amherst. 

Sarah  Tschampel,  Senior  Thesis  in  chemistry,  “The  Computational  Design  of  a  Transition  State 
Analog  for  the  Hydrolysis  of  Cocaine.”  One  publication  from  her  undergraduate  research, 
publication  #45;  Currently  graduate  student  in  chemistry  at  the  University  of  Georgia  (Rob 
Woods). 

James  Lempenau,  Senior  Thesis  in  biochemistry,  “Analysis  of  Netropsin  Binding  with  A-T  Rich 
DNA”  Currently  working  on  Wall  Street  (J.P.  Morgan). 

Lake  Forest  College  supervised  18  students  in  undergraduate  research  projects  ( 9F ');  ten 

entered  Ph.D.  programs  (4F);four  entered  MS  or  MAT  programs  (2F); 
three  entered  professional  schools  for  health  professions  (2F);  and  one 
entered  industry  (F). 


Graduates 

2000 

Mariela  Serrano,  Masters  of  Biochemical  Engineering,  University  of  Delft,  The  Netherlands. 

Becky  Turner,  completed  a  Master  of  Arts  in  Teaching  program  at  the  University  of  Maine. 
Currently  middle  school  math  and  science  teacher. 

1998 

Dale  Miller,  Honors  Senior  Thesis  “Purification  and  Crystallization  of  the  poly(dT) 
•poly(dA)-poly(dT)  DNA  Triplex  Complexed  with  the  Intercalator,  Damian’s  Salt.”  Enrolled  as  a 
biochemistry  graduate  student  at  Colorado  State  University. 

1997 

Edward  Sherer  -  Honors  Senior  Thesis  “Catalytic  Hydrolysis  of  Cocaine:  Mechanism  and 
Design.”  Fulbright  Graduate  Fellowship  (England),  1997-98;  Five  publications  from  his 
undergraduate  research,  publications  #25,  #26,  #28,  #29,  &#3 1 ;  Ph.D.  completed  at  the 
University  of  Minnesota  (Chris  Cramer).  Currently  Research  Scientist  at  Rib-X  Pharmaceuticals. 

Gordon  Turner  -  Honors  Senior  Thesis  “Computational  Design  of  New  Drugs  to  Treat  Cocaine 
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Addiction.”  Four  publications  from  his  undergraduate  research,  publications  #25,  #26,  #29,  & 
#31;  Ph.D.  in  physical  chemistry  completed  at  Yale  University  (Charles  Schmuttenmaer). 
Postdoctoral  research  fellow  at  Harvard  Medical  School,  in  the  Department  of  Radiology,  Center 
for  Molecular  Imaging  Research  (Vasilis  Nziachristos).  Currently  employed  by  Novartis. 

1996 

Jeff  Johnson  -  Summer  research  project  in  X-ray  crystallography.  Enrolled  as  a  chemistry 
graduate  student  at  the  University  of  Colorado,  Boulder. 

Nichole  Roberts  -  Summer  research  project  in  X-ray  crystallography.  M.S.  Biology,  University 
of  Colorado.  Currently  in  medical  school  at  Boston  University. 

1995 

Heather  Brummel  -  Honors  Senior  Thesis,  recipient  of  the  Phi  Beta  Kappa  Senior  Thesis  Award 
“Semiempirical  Study  of  the  Bergman  Reaction:  A  Computationally  Efficient  and  Accurate 
Method  for  Modeling  the  Enediyne  Anticancer  Antibiotics.”  Ph.D  Biochemistry,  University  of 
Colorado  at  Boulder  (Marvin  Caruthers).  Brummel’s  thesis  work  has  been  published,  publication 
#24.  Currently  Research  Scientist  at  Agilent  Tech  in  the  Bay  Area. 

Karl  Kirschner  -  Honors  Senior  Thesis,  “Quantum  Mechanical  Investigation  of  Adenosine 
3’,5’-Cyclic  Monophosphate.”  Ph.D.  Physical  Chemistry,  University  of  Georgia  (Phillip  Bowen). 
Kirschner  has  published  four  papers  based  on  his  undergraduate  research,  publications  #19,  #23, 
#27,  &  #28.  Currently  Visiting  Assistant  Professor  of  Chemistry  at  Hamilton  College. 

Tricia  Lively  -  Honors  Senior  Thesis,  “Design  of  an  Expression  System  for  Chlorella  XZ-6E 
CviRI  Methylase,  Protein  Purification  of  CviRI  Methylase,  Providencia  stuartii  1 64  PstI 
Endonuclease  and  Methylase  Proteins,  and  Crystallization  of  PstI  Endonuclease  with  a  20 
Basepair  Sequence  of  DNA.”  Ph.D.  completed  at  the  University  of  Colorado  at  Boulder  (Jim 
Goodrich).  Lively  has  published  two  papers  from  her  undergraduate  research,  publications  #22 
&  #29.  Currently  postdoc  with  Erwin  Gelfand  at  the  National  Jewish  Medical  and  Research 
Center  in  Denver. 

1993 

Roger  Sommer  (co-advised  with  M.L.  Thompson)  -  Honors  Senior  Thesis,  recipient  of  the  Phi 
Beta  Kappa  Senior  Thesis  Award  “Computational  and  Experimental  Approaches  to  the  Synthesis 
of  Polycyclic  Phosphazanes.”  Ph.D.  Inorganic  Chemistry,  University  of  Colorado  at  Boulder 
(Gordon  Yee).  Postodoctoral  associate  at  the  University  of  Delaware,  (Amie  Rheingold);  taught 
at  Middlebury  College;  currently  Assistant  Professor  of  Chemistry  at  Loyola  University. 

1992 

Ashish  Desai  -  Honors  Senior  Thesis  “Evolution  of  Lysozyme  and  Alpha-lactalbumin.”  MS 
Chem.,  University  of  Illinois  at  Chicago  (Albert  Benight).  MBA,  Texas  A&M. 

Graham  Haas  -  Senior  Project  “Hydrogen  Bonding  of  Formic  Acid:  A  FT-IR  Study.”  MS  Chem., 
Georgia  Tech  (Ken  Busch).  Employed  in  industry. 

Sarah  Gallagher  -  Summer  Research  in  Thermodynamics.  Ph.D.  Physical  Chemistry,  University 
of  Colorado  at  Boulder  (David  Jonas). 

1991 
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Kris  Henningfeld  -  Honors  Senior  Thesis  “Purification  of  Glyceraldehyde-3-Phosphate 
Dehydrogenase  from  Chicken  Muscle.”  Ph.D.  Chem.,  University  of  Virginia  (Sidney  Hecht). 
Currently  postdoctoral  associate  in  the  Department  of  Oncology /Hematology  at  the  University  of 
Ulm,  Germany. 

Tracey  Holmes  -  Honors  Senior  Thesis  “Purification  and  Characterization  of  Pigeon  Intestine 
Alkaline  Phosphatase.”  Currently  research  and  development  technician  at  Abbott  Laboratories. 

Marcus  Jurema  -  Honors  Senior  Thesis,  co-recipient  of  the  Phi  Beta  Kappa  Senior  Thesis 
Award  “An  Investigation  of  the  Applicability  of  the  PM3  Semi-empirical  Quantum  Mechanical 
Method  to  Hydrogen  Bonded  Systems.”  M.D.  Stanford  University;  further  training  at  Cornell 
Medical  School  and  Johns  Hopkins  Medical  School.  Jurema’s  thesis  work  was  published  in  three 
publications,  publications  #18  and  #19,  and  #22.  Currently  on  faculty  at  Brown  Medical  School 
as  a  Clinical  Assistant  Professor  of  Reproductive  Medicine  and  Infertility. 

1990 

Georgia  Kalkanis  -  Honors  Senior  Thesis  “Quantum  Mechanical  Calculations  of  the  Reactions  of 
CH2OH  with  02,  NO  and  N02.”  D.D.S.,  University  of  Detroit.  In  private  practice  in  Detroit. 
Kalkanis’s  thesis  work  was  published  in  1991,  publication  #16. 

INVITF.D  I  .F.CTI  FRF.S 

23.)  "Computational  Design  of  a  Small  Peptide  that  Inhibits  Breast  Cancer.”  Memorial  Sloan-Kettering 
Cancer  Center,  September  12,  2006.  Audience  40. 

22.)  "Computational  design  of  a  small  peptide  that  inhibits  breast  cancer:  An  overview  of  computational 
chemistry  research  at  Hamilton  College."  38th  ACS  Middle  Atlantic  Regional  Meeting,  June  5,  2006. 
Audience  25. 

21.)  "Computational  Design  of  a  Small  Peptide  that  Inhibits  Breast  Cancer."  Florida  State  University, 
March  23,  2006.  Audience  30. 

20.)  "Computational  Design  of  a  Small  Peptide  that  Inhibits  Breast  Cancer"  at  the  46th  International 
Symposium  on  Atomic,  Molecular,  Biophysical,  and  Condensed  Matter  Theory,  March  6,  2006,  St. 
Simons  Island,  GA.  Audience  125. 

19.)  "Water  Clusters  in  the  Atmosphere:  An  Overview  of  Computational  Chemistry  Research  at 
Hamilton  College."  Ithaca  College,  November  15,  2005.  Audience  25. 

1 8.)  "Water  Clusters  in  the  Atmosphere:  An  Overview  of  Computational  Chemistry  Research  at 
Hamilton  College."  Syracuse  University,  September  20,  2005.  Audience  30. 

1 7.)  "Water  Clusters  in  the  Atmosphere:  An  Overview  of  Computational  Chemistry  Research  at 
Hamilton  College."  University  of  Minnesota.  Minneapolis,  December  14,  2004.  Audience  100,  plus 
seminar  was  webcast  to  35  other  colleges  in  the  RSEC  program. 

16.)  “Hybrid  QM/MM  Models”  at  the  44th  International  Symposium  on  Atomic,  Molecular,  Biophysical, 
and  Condensed  Matter  Theory,  March  2004,  St.  Augustine,  Florida.  Audience  30. 

15.)  “Formation  of  MERCURY  to  Enhance  Undergraduate  Computational  Chemistry:  Accurate  pKa 
Calculations  in  Aqueous  Solution,  Progress  &  Challenges”  at  the  35th  ACS  Central  Regional  Meeting, 
Pittsburgh,  PA,  October  21, 2003.  Audience  35. 
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14.)  “An  Overview  of  Chemical  Physics  Research  at  Hamilton  College:  Accurate  pKa  Calculations  in 
Aqueous  Solution,  Progress  and  Challenges.”  University  of  Maryland,  College  Park,  April  16,  2003. 
Audience  40. 

13.)  “Career  Options  in  the  Field  of  Chemistry.”  Clinton  Central  High  School,  two  Honors  Chemistry 
Classes,  December  4,  2002.  Audience  24. 

12.)  “Basic  Research  and  Drug  Design:  Computational  Chemistry  at  Hamilton  College.”  Wright  State 
University,  October  1 8,  2002.  Audience  32. 

1 1. )  “Formation  of  MERCURY  and  Acquisition  of  High-Performance  Computers  to  Support  Research  in 
Undergraduate  Computational  Chemistry  across  New  York  and  New  England”  at  the  Executive  Summit, 
Delivering  Technology  Leadership  for  Life  Sciences:  Research  Advances  for  Drug  Discovery  and 
Bioterrorism.  A  Life  Sciences  Executive  Summit  co-sponsored  by  SGI,  the  Delaware  Biotechnology 
Institute  (DBI),  and  the  University  of  Delaware.  DBI,  October  3,  2002.  Audience  100. 

10.)  “Basic  Research  and  Drug  Design:  Computational  Biochemistry  at  Hamilton  College.”  Union 
College,  February  21,  2002.  Audience  40. 

9.)  “How  to  Succeed  in  College”  Marcellus  Academic  Club  Honors  Recognition  Dinner,  Marcellus, 
N.Y.,  March  25,  2001.  Audience  100. 

8.)  “One  Semester,  Accelerated  General  Chemistry  at  Hamilton  College.”  St.  Lawrence  University, 
December  7,  2000.  Audience  6. 

7.)  “An  Overview  of  Computational  Chemistry  at  Hamilton  College:  Drug  Design,  the  Nature  of  Water 
Dissociation,  and  Accurate  pKa  Calculations.”  St.  Lawrence  University,  December  7,  2000.  Audience 

12. 

6.)  “An  Overview  of  Computational  Chemistry  at  Hamilton  College:  Drug  Design,  the  Nature  of  Water 
Dissociation,  and  Accurate  pKa  Calculations.”  Vassar  College,  November  29,  2000.  Audience  32. 

5.)  “Chemical  Physics  in  the  Chemistry  Department.”  Hamilton  College,  Department  of  Physics, 
November  6,  2000.  Audience  25. 

4.)  “Computational  Chemistry  at  Hamilton  College:  Drug  Design,  the  Nature  of  Water  Dissociation,  and 
Accurate  pKa  Calculations.”  Connecticut  College,  October  24,  2000.  Audience  32. 

3.)  “Computational  Chemistry  at  Hamilton  College:  Drug  Design,  the  Nature  of  Water  Dissociation,  and 
pKa  Calculations.”  Holy  Cross  College,  September  29,  2000.  Audience  34. 

2.)  “An  Overview  of  Computational  Chemistry  at  Hamilton  College.  Can  quantum  chemistry  be  used  to 
calculate  accurate  pKa  values?”  Hobart  &  William  Smith  Colleges,  June  2,  2000.  Audience  65. 
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research  in  cancer  biochemistry. 

Met  with  Veronica  Vaida  at  the  University  of  Colorado  and  colleagues  at  NCAR 
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Abstract 

Esperamicin  A|  is  a  member  of  a  larger  family  of  naturally  occurring  products 
that  share  a  novel  molecular  architecture  and  potent  biological  activity.  The  purpose  of 
this  study  is  to  derive  and  evaluate  an  effective  model  compound  and  methodology  for 
the  computational  study  of  the  Bergman  cyclization  in  this  naturally  occurring  product. 
The  model  compound  presented  in  this  paper,  MTC2,  achieves  a  significant  reduction  in 
size  by  omitting  the  chemically  inert  oligosaccharide  arms  that  are  crucial  for 
esperamicin  Ai’s  ability  to  recognize  and  dock  to  DNA.  We  present  an  effective 
computational  methodology  for  the  study  of  Bergman  cyclization  in  small  and  moderate 
sized  enediyne  molecules. 

1.  Introduction 

Esperamicin  A|  (Figure  1)  is  a  member  of  a  larger  family  of  naturally  occurring 
products  that  share  a  novel  molecular  architecture  and  potent  biological  activity.  Other 
members  of  this  family  include:  dynemicin  A,  neocarzinostatin  and  calicheamicin  Tj,  The 


esperamicins  were  discovered  in  1985,'  isolated  from  a  fermentation  broth  of 
actinomadura  verrucosospora.  Esperamicin  was  named  for  the  place  where  the  bacteria 
responsible  for  the  production  of  this  compound  were  collected,  Pto  Esperanza,  in 
Argentina.  In  1987  the  structures  of  the  esperamicins  were  reported.2,3  All  four  of  these 
enediyne  compounds  have  drawn  significant  interest  as  potential  sources  of  anticancer 
drugs  because  of  their  ability  to  cleave  DNA,  presumably  through  a  Bergman 
cyclization.4  These  compounds  are  among  the  world’s  most  powerful  anti-tumor  agents, 
as  they  exhibit  high  activity  against  a  variety  of  murine  tumor  models  at  injected  doses  in 
the  0.1  pg'kg'1  range.5'6  However,  the  enediyne  natural  products  are  not  selective  in  their 
activity,  and  will  cleave  DNA  in  both  healthy  and  cancerous  cells.  This  indiscriminant 
behavior  has  spurred  a  flurry  of  both  experimental7  "and  computational12'18  research 
focused  upon  rational  drug  design  featuring  the  enediyne  moiety.  However,  the  exact 
mechanism  responsible  for  esperamicin  Afs  cytotoxicity  is  not  thoroughly  understood. 
This  paper  is  an  initial  step  towards  understanding  how  this  natural  product  may  cleave 
DNA  at  a  molecular  level. 


Figure  1.  The  structure  of  the  untriggered  esperamicin  Ai  molecule. 


In  the  most  commonly  accepted  mechanism  for  esperamicin’s  efficacy,  a 
bioreductuve  cleavage  of  the  trisulfide  tail  is  carried  out  by  glutathione  generating  a 
thiol.19  The  thiol  then  undergoes  an  intramolecular  addition  to  the  10-membered  ring 
containing  the  enediyne  moiety,  reducing  strain  and  increasing  flexibility  in  the  enediyne 
ring,20'22  which  significantly  lowers  the  activation  barrier  for  Bergman  cyclization 
(Figure  2).  This  cyclization  reaction  is  an  electronic  rearrangement  reaction  where  the 
1 ,5-diyne-3-ene  moiety  rearranges  to  form  a  1 ,4-didehydrobenzene  diradical.  Abstraction 
of  hydrogen  atoms  from  the  DNA  backbone  by  the  diradical  leads  to  single-stranded 
DNA  cleavage  and  eventual  apoptosis.  Recently,  Alabugin  et alP  proposed  an  alternate 
mechanism  for  radical  hydrogen  abstraction.  They  argue  for  the  feasibility  of  an 
intermediate  intramolecular  hydrogen  abstraction  step  that  would  occur  after  Bergman 
cyclization  and  before  the  hydrogen  abstraction  from  DNA.  This  intramolecular 
abstraction  would  essentially  ‘quench’  the  p-benzyne  intermediate  and  make  deactivation 
pathways  such  as  the  retro  Bergman  cyclization  inaccessible,  increasing  the  lifespan  of 
the  DNA  damaging  species. 
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Figure  2.  Schematic  of  the  Bergman  Cyclization  and  hydrogen  abstraction. 

The  purpose  of  this  study  is  to  derive  and  evaluate  an  effective  model  compound 
and  methodology  for  the  computational  study  of  the  Bergman  cyclization  in  naturally 
occurring  products,  specifically  esperamicin  Ai.  Because  of  the  extensive  size  of  the 


esperamicin  A|  molecule,  and  the  rigorous  theoretical  treatment  that  its  multi-reference 
electronic  state  demands,  we  have  chosen  to  work  with  a  series  of  smaller  models  that 
contain  esperamicin  A|’s  chemically  active  core.  The  model  compound  presented  in  this 
paper,  MTC2  (Figure  3),  is  a  variation  on  the  smaller  MTC  chemical  model  previously 
used  by  our  group  to  assist  in  the  study  of  the  Bergman  cyclization.24  The  MTC2  model 
achieves  a  significant  reduction  in  size  by  omitting  the  chemically  inert  oligosaccharide 
arms  that  are  important  for  esperamicin  Afs  docking  to  DNA.  Also,  since  this  study  is 
not  concerned  with  the  triggering  mechanism,  the  nitrogen  tail,  which  is  adjacent  to  the 
five-membered  sulfur  heterocycle,  has  also  been  pruned  to  further  reduce  the  size  of  the 
system.  These  omissions  reduce  the  system  size  from  one  hundred  sixty  atoms  to  forty 
atoms,  yet  the  MTC2  model  incorporates  esperamicin  Afs  multi-cyclic  ring  structure  and 
neighboring  substituent  groups  into  the  calculations. 


Figure  3.  The  MTC2  model  system  used  in  this  study.  Hydroxyl  groups  have  replaced  the  carbohydrates 
and  a  hydrogen-capping  atom  has  replaced  the  nitrogen  tail. 

2.  Methods 

The  Bergman  cyclization  is  a  difficult  reaction  to  model  because  of  the  multi- 
configurational  nature  of  the  diradical  species.  Reliable  wavefunction  descriptions  of  the 
diradical  state  require  extensive  incorporation  of  dynamic  electron  correlation  effects. 


Both  the  UHF-CC  and  RB-CC  levels  of  theory  have  been  shown  to  provide  accurate 
diradical  state  descriptions  in  /?-benzyne,25  but  these  methods  are  prohibitively  expensive 
for  all  but  the  smallest  enediyne  model  systems.  Semiempirical  theory  has  shown  some 
promise,  yet  it  is  difficult  to  benchmark  these  methods  without  experimental  results  on 
the  natural  products  themselves.26  Density  functional  theory  (DFT)  has  emerged  as  a 
promising  high-accuracy,  low-cost  alternative  to  high  order  dynamic  correlation  methods. 
In  the  past  five  years  broken-symmetry  (BS-UDFT)  approaches  have  emerged  as  an 
acceptable  alternative  to  dynamic  correlation  methods.  While  DFT  has  been  widely 
lauded  for  its  ability  to  reproduce  experimental  enthalpies  for  a  variety  of  small 
enediynes,  it  is  important  to  note  that  DFT’s  ability  to  accurately  model  spin  effects, 
magnetic  properties  and  other  properties  that  depend  upon  spin  is  still  in  question.27 

Because  of  the  open  shell  nature  of  both  the  diradical  and  transition  state  species, 
all  calculations  on  these  structures  were  performed  as  BS-UDFT  calculations.  Both  the 
“Guess=Mix”  and  “Stable=Opt”  keywords  were  used  to  generate  open  shell  singlet 
wavefunctions.  We  obtained  two  sets  of  optimized  geometries  for  this  system,  using  the 
B3LYP  and  the  PBE1PBE  (PBEO)  density  functionals  with  the  6-31G(d,p)  basis  set. 
Frequency  calculations  were  carried  out  upon  all  optimized  structures  to  ensure  their  true 
minimization  and  to  provide  thermochemical  corrections  to  computed  electronic  energies. 
All  thermochemistry  data  is  given  at  a  standard  state  of  one  atmosphere.  The  pre- 
cyclized  and  diradical  structures  represent  local  minima  on  the  potential  energy  surface, 
and  have  no  imaginary  frequencies.  The  optimized  transition  state  structures  have  one 
imaginary  frequency  along  the  mode  corresponding  to  the  bond  formation  in  the 
Bergman  Cyclization  reaction.  A  series  of  single  point  calculations  were  carried  out  on 


both  sets  of  optimized  structures  in  the  gas  phase  using  the  following  basis  sets:  6- 
31  lG(2d,p),  cc-pVDZ,  aug-cc-pVDZ,  cc-pVTZ  and  aug-cc-pVTZ.28'31  Tight  SCF 
convergence  criteria  were  requested  for  all  calculations.  Solvation  was  modeled 
implicitly  using  a  polarizable  continuum  model  (IEFPCM)32'34  along  with  a  molecular 
cavity  optimized  for  use  at  the  HF/6-31G*  level  of  theory  (UAHF  model).  All 
computations  were  performed  using  revisions  B.02  and  C.02  of  Gaussian  03. 35 

3.  Results  &  Discussion 

The  most  challenging  aspect  of  modeling  this  system  is  accurately  and  reliably 
describing  the  diradical  product  molecule.  This  challenge  is  two-fold:  finding  an 
acceptable  single  determinant  method  for  describing  the  multi-reference  diradical  state, 
and  converging  to  the  correct  open  shell  state.  Because  of  the  quasi-degeneracy  of  the 
two  outermost  occupied  orbitals25  it  is  easy  to  converge  to  an  incorrect  closed  shell  state. 
Thus,  we  need  a  metric  to  determine  if  our  converged  electronic  state  is  truly  the  most 
stable  one. 

3.1  Wavefunctions 

As  stated  in  the  methods  section,  BS-UDFT  was  used  to  produce  open  shell 
singlet  states  for  the  diradical  species;  unfortunately  this  method  was  unreliable.  While 
usually  effective,  there  were  instances  where  the  broken  symmetry  wavefunction  still 
collapsed  to  a  closed  shell  state,  resulting  in  poor  energies  (errors  exceeding  20  kcabmol 
').  We  ascertained  whether  a  state  properly  converged  to  an  open  shell  solution  by 
inspection  of  the  (S2)  values.  For  transition  states,  we  found  that  the  most  stable  open 
shell  states  had  a  spin  excess  of  approximately  0. 1  before  spin  annihilation.  We  also 


found  that  the  most  stable  open  shell  diradical  wavefunctions  had  a  spin  excess  of 
approximately  1  prior  to  spin  annihilation.  This  result  is  qualitatively  correct  because  the 
erroneous  closed  shell  wavefunctions  will  have  no  triplet  contamination  and  therefore 
spin  symmetry  breaking  is  not  taking  place.  The  empirical  accuracy  of  (S2)  values 
predicted  by  DFT  has  been  a  topic  of  some  debate.36  We  are  not  interested  in  the  (S2) 
values  in  and  of  themselves;  we  are  only  interested  in  them  as  a  metric  for  determining 
whether  we  have  in  fact  converged  to  an  open  shell  solution. 

In  the  cases  where  BS-UDFT  seemed  to  fail  to  produce  open  shell  states,  we 
tested  the  stability  of  the  solutions  via  the  “Stable=Opf  ’  keyword.  This  routine  tests  the 
stability  of  a  wavefunction  using  Hermitian  stability  matrices,  and  if  the  wavefunction  is 
not  stable,  re-optimizes  it  by  permuting  the  orbital  space  of  the  original  solution  as 
dictated  by  the  eigenvectors  of  the  stability  matricies.37  While  this  method  is  very 
reliable,  it  is  extremely  expensive,  and  thus  its  utility  is  limited  to  small  systems.  As  long 
as  the  (S2)  values  are  rigorously  checked,  BS-UDFT  is  a  fine  method  for  generating 
proper  states.  In  the  rare  cases  where  BS-UDFT  fails,  and  stability  calculations  are 
prohibitively  expensive,  we  found  that  a  combination  of  symmetry  breaking  and  valence 
orbital  shifting  to  be  useful.  While  this  method  is  also  unreliable,  the  calculations  are 
quick  enough  that  a  ‘guess  and  check’  approach  is  feasible,  using  the  (S2)  values  as  the 
diagnostic  tool. 

3.2  Benchmarking  Calculations 

In  previous  work24  we  made  benchmark  calculations  on  three  small  enediyne 
molecules:  cw-hex-3-ene-l,5-diyne  (1),  o-diethynylbenzene  (2)  and  cyclodec-3-ene-l,5- 


diyne  (3),  shown  in  Figure  4.  These  three  enediynes  are  particularly  important  because 
experimental  data  is  available  for  both  enthalpies  of  reaction  and  enthalpies  of  activation. 


1  2  3 

Figure  4.  Three  small  enediyne  molecules  for  which  accurate  experimental  data  is  available. 

Calculated  enthalpies  from  our  previous  work,24  from  the  work  of  Schreiner  et 
al.,iS  and  experimental  values39'41  are  presented  in  Table  1.  A  few  interesting  trends 
emerge.  First,  enthalpies  of  reaction,  calculated  using  the  B3LYP  functional,  improve 
with  relation  to  experiment  as  the  basis  set  is  expanded  from  double-zeta  to  triple-zeta. 
Second,  enthalpies  of  reaction  are  consistently  underestimated  for  double-zeta 
calculations  for  both  1  and  2.  Conversely,  enthalpies  calculated  at  the  triple-zeta  level  are 
slightly  overestimated.  Third,  for  the  two  pure  DFT  functionals,  BLYP  and  G96LYP,  the 
opposite  trend  emerges.  Enthalpies  calculated  with  the  smaller  double-zeta  basis  sets 
follow  experimental  results  more  closely,  and  triple-zeta  calculations  result  in  much 
larger  errors.  This  result  implies  that  pure  density  functionals  may  be  inferior  to  hybrid 
density  functionals  for  modeling  p-benzyne  diradicals  and  their  analogues  accurately.42 
Fourth,  enthalpies  of  activation  show  reverse  trends  from  the  enthalpies  of  reaction. 
Double-zeta  B3LYP  calculations  appear  to  be  highly  accurate  predictors  of  A//*,  while 
the  triple-zeta  calculations  overestimate  the  barrier  by  4-6  kcahmol1.  Both  pure 


functionals  consistently  underestimate  the  barrier  height,  though  expansion  of  the  basis 
set  from  double-  to  triple-zeta  reduces  the  errors  from  ~4  to  ~1  kcabmol1. 

From  our  reevaluation  of  our  previous  benchmarking  data  and  the  calculations 
performed  by  Schreiner  et  al.,  we  decided  to  employ  the  B3LYP  hybrid  functional  for 
studying  the  MTC2  model  system.  Because  the  errors  relative  to  experimental  enthalpies 
of  reaction  increase  when  the  basis  set  is  expanded  when  using  the  pure  functionals,  the 
good  agreement  between  the  double-zeta  results  and  experiment  is  likely  due  to  a 
fortuitous  cancellation  of  errors.  Also,  the  use  of  a  sum  formula  was  shown  to 


dramatically  improve  results  calculated  with  B3LYP,  but  not  for  results  calculated  with 
pure  functionals.42,43  This  formula  estimates  the  energy  of  a  pure  singlet  state  by 
assuming  that  all  spin  contamination  is  introduced  by  the  first  triplet  state.  The  energy 


contributions  from  this  triplet  state  are  then  estimated  and  removed. 

Table  1.  Enthalpies  of  activation  and  reaction  for  three  benchmark  molecules,  deviations  from  experiment 
are  italicized.  All  values  are  in  kcal'mol1. 

1  _  _ 2 _ 3 _ 


AH_ AH'-  (470  K) AH  AH*  (486  K) AH_ AH*  (298  K) 


B3LYP  6-3lG(d)fc 

3.94 

-4.6 

29.31 

1.1 

6-31G(d,p)a 

3.0 

-5.5 

29.4 

1.2 

12.0  -5.8 

29.5  4.3 

3.3 

26.0  2.8 

6-31  +  +G(d,p)fi 

5.90 

-2.6 

29.39 

1.2 

cc-pVDZ" 

4.33 

- 4.2 

28.95 

0.8 

6-311G(d,p)6 

10.34 

1.8 

32.42 

4.2 

cc-pVTZa 

10.0 

1.5 

34.2 

6.0 

18.6  0.8 

33.0  7.8 

9.6 

29.7  6.5 

cc-pVTZ" 

10.87 

2.4 

32.42 

4.2 

BLYP  6-31G(d)® 

7.25 

-1.3 

23.75 

-4.5 

6-31++G(d,p)6 

9.06 

0.6 

24.01 

- 4.2 

cc-pVDZ0 

7.59 

-0.9 

23.51 

-4.7 

6-311G(d,pf 

13.60 

5.1 

26.89 

-1.3 

cc-pVTZft 

14.09 

5.6 

26.98 

-1.2 

G96LYP  6-31G(d)* 

6.99 

- 1.5 

24.20 

-4.0 

6-31  +  +G(d,p)* 

8.31 

-0.2 

24.18 

-4.0 

cc-pVDZ& 

7.14 

- 1.4 

23.83 

-4.4 

6-311G(d,p)* 

12.98 

4.5 

26.98 

-1.2 

cc-pVTZ* 

13.26 

4.8 

26.97 

-1.2 

Experiment 

=2^ 

1.S— ~ 

28.2*0.5‘ 

17,8±l,2‘l 

25.2*9.8” . 

_ 

"  Ref. 24,  geometries  were  optimized  using  the  6-31G(d,p)  basis  set 
b  Ref.  38,  geometries  were  optimized  using  the  6-3 1  G(d)  basis  set 
‘  Ref. 39 
‘'Ref.40 

f  Ref.  4I,  using  the  relationship  between  Ea  and  H  for  an  unimolecular  reaction  (A//;  =  Ea-  RT) 
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3.3  Functional  Effects 

All  calculations  used  two  hybrid  density  functionals;  the  three  parameter  B3LYP 
functional  and  the  one  parameter  PBEO  functional.44  One  of  the  advantages  of  the  PBEO 
functional  is  its  ability  to  provide  results  superior  to  empirically  derived  density 
functionals  for  a  variety  of  chemical  properties  (magnetic,  vibrational  and  electronic).45 
This  seems  to  be  an  obvious  advantage  over  B3LYP,  which  has  been  shown  to  poorly 
predict  magnetic  and  electronic  properties  for  p- benzynef 7 

Geometry  optimizations  performed  with  the  functionals  produced  two  sets  of 
three  geometries  (Figure  5  and  Supplemental  Information).  While  the  two  methods 
generally  reproduced  each  other’s  bond  lengths  within  a  hundredth  of  an  angstrom,  the 
non-bonded  intemuclear  distances  were  significantly  different.  Figure  5  depicts  the 
important  non-bonded  intemuclear  distances  of  the  chemically  active  region  for  all  six 
optimized  geometries.  In  all  cases,  the  PBEO  functional  predicts  shorter  distances,  and 
tighter  angle  bends  within  the  ring.  These  differences  are  most  significant  in  the  pre- 
cyclized  structure,  where  the  PBEO  structure  has  the  key  carbon  distances  shortened  by 
0.024  and  0.026A,  respectively.  This  contraction  effect  is  reduced  in  the  transition  state 
as  the  benzyne  ring  is  partially  formed,  and  the  PBEO  structure  has  the  same  carbon 
distances  shortened  by  0.015  and  0.01 2A  relative  to  the  B3LYP  structure.  Once  the 
Bergman  cyclization  is  completed  and  the  p-benzyne  ring  is  fully  formed,  the  two 
different  DFT  structures  now  have  carbon  distances  that  differ  by  less  than  a  hundredth  of 


an  angstrom. 


Figure  5.  Critical  intemuclear  distances  for  the  three  optimized  MTC2  geometries.  Non-bonded  distances 
are  displayed  with  a  dotted  line,  the  semi-bond  of  the  transition  state  is  displayed  with  a  dashed  line. 
Geometries  were  optimized  with  both  the  B3LYP  and  PBEO  (in  parenthesis)  functionals  using  the  6- 
3  lG(d,p)  basis  set.  All  values  are  given  in  angstroms. 


These  subtle  differences  in  geometry  appear  to  manifest  themselves  as  significant 
differences  to  computed  energy.  While  B3LYP/6-31G(d,p)  predicts  A Ee  and  A Ee*  values 
of -2.34  and  22.52  kcaEmol'1,  similar  PBEO  calculations  give  dramatically  different 
results;  -1 1.93  and  19.62  kcaBmol'1.  Increasing  the  basis  set  size  for  subsequent  single 
point  calculations  produces  no  significant  changes  to  these  results  (as  explained  later). 
We  believe  that  the  large  negative  A Ee  predicted  by  the  PBEO  calculation  stems,  at  least 
partially,  from  significant  destabilization  in  the  PBEO  optimized  pre-cyclized  MTC2 
structure.  Figure  6  illustrates  this  effect  by  depicting  the  B3LYP  and  PBEO  reaction 
coordinates.  Because  there  were  no  significant  geometric  distortions  between  the  two 
optimized  diradical  geometries,  the  two  reaction  coordinates  were  ‘zeroed’  energetically 
with  respect  to  products.  Once  the  product  energies  are  normalized,  the  difference 
between  the  B3LYP  and  PBEO  results  is  apparent.  The  PBEO  reactant  and  transition- 
state  structures  are  destabilized  relative  to  those  obtained  with  B3LYP. 


Figure  6.  Reaction  coordinate  of  the  MTC2  Bergman  cyclization.  Both  B3LYP  (black)  and  PBEO  (red) 
reaction  coordinates  are  displayed  with  both  product  species  normalized  to  each  other  with  respect  to 
absolute  energy.  Relative  electronic  energies  were  calculated  with  the  6-31G(d,p)  basis  and  are  given  in 
kcalwiol'1. 


3.4  Basis  Set  Effects 

We  used  six  different  basis  sets  in  this  study  in  an  effort  to  study  which  types  of 
basis  functions  would  be  computationally  efficient  for  larger  enediyne  systems.  While  all 
geometry  optimizations  used  the  well  established  6-31G(d,p)  basis  set,  a  variety  of  other 
basis  sets  were  tested  using  a  series  of  single  point  calculations.  Specifically,  we  tested 
the  differences  between  similarly  sized  Pople  and  Dunning  style  basis  sets,  the 
importance  of  expanding  from  a  double-  to  a  triple-zeta  basis  set  and  what  impact  the 
augmentation  of  diffuse  functions  may  have.  Enthalpies,  free  energies  and  electronic 
energies  of  reaction  calculated  with  these  six  basis  sets  are  displayed  in  Table  2. 


As  was  the  case  with  the  small  enediynes  (1  and  2)  used  for  previous 
benchmarking  studies,24  an  expansion  of  the  basis  set  from  double-  to  triple-zeta 
produced  significant  changes  in  the  calculated  energies  of  reaction.  Predicted  enthalpies 
of  cyclization  became  ~5  kcabmof1  more  exothermic  for  each  of  the  three  pairs  of 
corresponding  double-/triple-zeta  basis  sets.  This  trend  was  also  present  in  the  PBEO 
data,  as  triple-zeta  calculations  were  consistently  ~4.5  kcahmol'1  more  exothermic  than 
their  double-zeta  counter  parts.  These  variations  in  enthalpies  are  slightly  smaller  than, 
but  consistent  with  the  variations  in  calculated  enthalpies  for  1  and  2,  which  were  ~7 
kcabmol'1  more  exothermic  for  triple-zeta  calculations,  and  tended  to  agree  much  better 
with  experiment.24  Enthalpies  of  reaction  appeared  to  be  less  sensitive  to  this  type  of 
basis  set  expansion.  While  quantities  still  became  more  exothermic,  the  barrier  heights  to 
MTC2  cyclization  only  increased  by  ~3  -  3.5  and  ~2.5  -  3  kcabmol"1  for  B3LYP  and 
PBEO  respectively.  This  result  is,  again,  very  consistent  with  the  results  from  the  smaller 
benchmarking  enediynes  (1,2  and  3),  as  predicted  barrier  heights  in  these  small 
molecules  increased  by  ~2.5  -  3.5  kcabmol'1  when  the  basis  set  was  expanded  from 
double-  to  triple-zeta.24  The  inclusion  of  triple-zeta  single  point  calculations  is  essential 
for  any  computational  schemes  devised  for  this  type  of  system. 

Augmenting  a  basis  set  with  diffuse  functions  had  a  negligible  effect  upon 
calculated  energies  for  MTC2.  In  all  four  pairs  of  augmented/non-augmented 
calculations  (two  for  B3LYP  and  two  for  PBEO)  the  enthalpies  of  reaction  became  more 
endothermic  by  less  than  0.25  kcabmol'1.  Similarly,  changes  to  enthalpy  barriers  were 
also  very  small  and  endothermic,  the  largest  change  was  less  than  0.35  kcahmol'1. 

Diffuse  functions  are  an  unnecessary  expense. 


Differences  between  computations  performed  with  the  Pople  and  Dunning  style 
basis  sets  are  small  as  well,  with  the  larger  differences  occurring  at  the  triple-zeta  basis 
level.  For  enthalpies  of  reaction,  Dunning  basis  sets  consistently  produce  predictions  that 
are  more  exothermic  than  their  Pople  counterparts,  with  differences  less  than  0.35 
kcabmol'1  at  the  double-zeta  and  less  than  0.90  kcabmol'1  at  the  triple-zeta  levels  of 
theory.  Changes  to  calculated  barrier  heights  are  small  as  well.  Barrier  heights  at  the 
double-zeta  level  increase  by  less  than  0.30  kcabmol'1  and  by  less  than  0.50  kcabmol'1  at 
the  triple-zeta  level.  Again,  these  changes  were  consistent  with  the  data  from  molecule  1, 
where  differences  between  calculations  performed  using  the  Pople  and  Dunning  basis 
sets  were  quite  minor.  Also,  the  more  exothermic  Dunning  style  basis  set  calculations  are 
in  slightly  better  agreement  with  available  experimental  data  for  enediyne  1. 

Furthermore,  the  Dunning  basis  sets  are  both  smaller  than  their  Pople  counterparts  and  do 
not  employ  a  segmented  contraction  scheme.31  These  two  features  make  the  Dunning 
basis  sets  more  computationally  efficient,  and  thus  less  expensive  while  producing 
slightly  more  accurate  predictions  than  the  Pople  basis  sets.  Therefore,  it  would  seem 
slightly  advantageous  to  employ  Dunning  style  basis  sets  for  computations  upon 
enediyne  systems. 


Table  2.  Electronic  energies  of  reaction  and  gas-phase  DFT  thermochemistry  data  as  a  function  of  basis 
set.  All  values  presented  in  kcahmoT . 


B3LYP/X//6-31G(d,p) 


6-31G(d,p) 

cc-pVDZ 

aug-cc-pVDZ 

6-311G(2d,p) 

cc-pVTZ 

aug-cc-pVTZ 

A  Ec 

-2.34 

-2.00 

-1.76 

2.67 

3.57 

3.52 

A£e* 

22.52 

22.62 

22.58 

25.72 

26.23 

26.10 

A  H 

-2.50 

-2.16 

-1.92 

2.51 

3.41 

3.36 

AH1 

20.60 

20.70 

20.65 

23.80 

24.30 

24.18 

AG 

-1.03 

-0.69 

-0.45 

3.98 

4.88 

4.83 

AG* 

21.55 

21.65 

21.61 

24.75 

25.26 

25.13 

PBE0/X//6-31G(d,p) 

6-31G(d,p) 

cc-pVDZ 

aug-cc-pVDZ 

6-311G(2d,p) 

cc-pVTZ 

aug-cc-pVTZ 

A£c 

-11.93 

-11.70 

-11.78 

-8.01 

-7.36 

-7.38 

A  fe* 

19.62 

19.90 

19.57 

22.12 

22.43 

22.30 

AH 

-11.93 

-11.70 

-11.78 

-8.01 

-7.36 

-7.38 

AH* 

17.96 

18.24 

17.91 

20.46 

20.77 

20.65 

AG 

-10.43 

-10.20 

-10.28 

-6.52 

-5.86 

-5.88 

AG* 

18.93 

19.21 

18.88 

21.43 

21.74 

21.61 

3.5  Solvation  Effects 

A  series  of  B3LYP/6-31G(d,p)  geometry  optimizations  were  performed  in 
conjunction  within  a  dielectric  continuum  solvation  model.  Optimized  gas-phase 
structures  were  used  as  the  input  geometry.  Converged  aqueous-phase  geometries  were 
obtained  for  both  the  pre-cyclized  and  diradical  species.  Unfortunately  the  transition 
state  geometry  suffered  from  SCF  convergence  failures  with  the  open  shell  wavefunction. 
Re-optimization  of  the  pre-cyclized  geometry  produced  one  significant  change  to  the 
chemically  active  ring  structure,  as  the  non-bonded  carbon-carbon  intemuclear  distance 
lengthened  by  0.024A.  All  bonded  intemuclear  distances  in  the  pre-cyclized  region 
remained  relatively  constant  (A r  <  0.005A). 

In  addition  to  the  aqueous  geometry  optimization  calculations,  a  series  of  single 
point  aqueous  calculations  were  performed  on  optimized  gas-phase  geometries  using  the 
Gaussian  SCRF  option,  SCFVAC.  The  aqueous-phase  geometries  yielded  electronic 
energies  of  reaction  of  -1 .23  and  3.79  kcal*mol 1  for  the  6-31G(d,p)  and  6-3 1  lG(2d,p) 
basis  sets  respectively,  whereas  the  SCFVAC  calculations  yielded  electronic  energies  of 


reaction  of  -2.87  and  2.17  kcahmol'1  for  the  two  basis  sets,  a  change  of -1.6  kcabmol'1. 
The  aqueous-phase  free  energies  of  reaction  and  activation  predicted  by  SCFVAC  for 
MTC2  are  2.08  and  24.03  kcalwiol'1  respectively,  calculated  at  the  triple-zeta  level  of 
theory.  These  results  suggest  that  the  computationally  expensive,  and  labor  intensive 
process  of  geometry  re-optimization  in  PCM  may  be  unnecessary  to  achieve  accurate 
aqueous  phase  reaction  data. 

4.  Conclusion 

We  have  established  an  effective  computational  methodology  for  the  study  of 
Bergman  cyclization  in  small  and  moderate  sized  enediyne  molecules.  The  shortcomings 
of  the  PBEO  results  underscore  the  need  for  continued  development  of  density  functionals 
specifically  designed  to  study  biomolecules,  particularly  those  with  radical  and  diradical 
chemistry.  While  the  venerable  B3LYP  functional  is  the  most  cost  effective  way  to  study 
the  energetics  of  the  Bergman  cyclization,  numerous  questions  about  B3LYP’s  ability  to 
accurately  model  spin,  and  its  associated  magnetic  properties  still  persist.  Also, 
accurately  modeling  the  two  open  shell  states  (transition  state  and  diradical)  is 
challenging.  BS-UDFT  does  a  good  job  of  generating  open  shell  singlet  states,  however 
it  does  fail  at  times,  so  spin  excess  values  must  be  rigorously  checked  for  wavefunction 
collapse.  Stability  calculations  provide  dependable  alternatives  to  BS-UDFT,  these 
calculations  however  are  very  expensive  for  any  system  much  larger  than  MTC2  and 
prohibitively  expensive  for  esperamicin  A|.  A  cheaper,  yet  effective,  alternative  is 
needed.  Repeating  a  calculation  with  combinations  of  symmetry  breaking  and  valence 
orbital  shifting,  combined  with  checking  (S2)  values,  is  the  most  cost  effective  procedure 


at  present.  We  will  employ  our  methodology  in  the  future  study  of  larger  enediyne  model 
systems,  and  the  eventual  study  of  esperamicin  A|. 
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Abstract 

State-of-the-art  molecular  dynamics  simulations  predict  a  common  conformation 
shared  between  8-mer  and  cyclic  9-mer  peptides  that  inhibit  human  breast  cancer.  The 
predicted  common  structure  contains  a  conserved  reverse  turn.  The  simulations  were  used 
to  identify  the  smaller  peptide  analogs  TOVNO,  TPVNP,  TOVN,  and  TPVN  with  the 
same  conserved  reverse  turn.  These  analogs  have  been  synthesized  and  shown  to  inhibit 
estrogen-dependent  cell  growth  in  a  mouse  uterine  growth  assay,  a  test  that  shows 
reliable  correlation  with  the  inhibition  of  human  breast  cancer.  This  is  the  first  time  that 
Replica  Exchange  Molecular  Dynamics  has  been  used  to  predict  a  new  lead  drug 


compound. 


Introduction 

Breast  cancer  is  the  most  common  cancer  diagnosed  in  women  and  is  the  second 
leading  cause  of  cancer  death  among  women,  closely  following  lung  cancer.  In  2006,  the 
American  Cancer  Society  estimated  that  212,920  women  in  the  United  States  will  be 
diagnosed  with  invasive  breast  cancer  and  predicted  41,430  deaths  (/).  Tamoxifen  is  the 
most  widely  used  drug  for  the  treatment  of  estrogen  receptor-positive  breast  cancer, 
acting  through  competition  with  estrogen  for  binding  to  the  estrogen  receptor  (ER). 

However,  approximately  one  third  to  one  sixth  of  ER-positive  breast  cancers  are 
intrinsically  resistant  to  tamoxifen,  and  many  more  acquire  resistance  to  this  drug  during 
treatment  (2).  Additionally,  tamoxifen  stimulates  uterine  growth,  which  can  lead  to 
uterine  cancer  in  a  small  percentage  of  women  taking  this  drug  (2).  Consequently,  there  is 
ongoing  need  for  breast  cancer  drugs  with  greater  efficacy  and  fewer  side  effects. 

Alpha-fetoprotein  (AFP)  is  an  embryo  specific  serum  alpha-globulin  glycoprotein 
that  is  synthesized  primarily  by  the  fetal  liver  and  circulates  through  the  serum  of 
pregnant  women  (3).  AFP  has  been  reported  to  bind  and  transport  ligands,  including  fatty 
acids,  steroids,  heavy  metal  ions,  phytoestrogens,  drugs  and  some  toxins  (4).  AFP  is  a 
growth-regulating  hormone  with  the  capacity  to  stimulate  or  inhibit  growth  (J).  From 
fertilization  through  birth,  AFP  holds  an  important  physiological  role  as  a  developmental 
promoter  for  the  fetus.  More  recent  reports  have  shown  that  AFP  has  antiestrogenic 
activity  and  can  inhibit  the  growth  of  estrogen  dependent  cancer  (2,  6-8).  These  data, 
combined  with  epidemiological  data  showing  that  elevated  levels  of  AFP  are  associated 


with  reduced  lifetime  risk  of  breast  cancer  (7),  have  led  to  the  suggestion  that  AFP  or 
analogs  thereof  may  be  a  useful  agent  for  chemoprevention,  as  well  as  for  the  treatment, 
of  breast  cancer  (7,  9,  10).  Festin  et  al.  (11),  Eisele  et  al.  (12,  13),  and  Mesfin  et  al.  (14) 
parsed  the  70,000  MW  AFP  into  to  a  series  of  smaller  peptides  that  retained  the  same 
anti-estrogenic  and  anti-breast  cancer  activities  of  the  original  protein.  Their  work 
resulted  in  the  smallest  known  active  analog,  an  8-mer  with  sequence  EMTPVNPG 
(AFPep)  (14),  which  consists  of  amino  acids  472-479  from  the  human  AFP  sequence. 
Amino  acid  substitution  studies  revealed  the  importance  of  specific  residues  essential  for 
activity  (15).  All  efforts  since  then  to  create  an  active  peptide  consisting  of  fewer  than 
eight  residues  has  resulted  in  the  loss  of  anti-estrogenic  activity  (14,  15). 

A  receptor  for  AFP  on  cancer  cell  membranes  has  been  reported  (16),  and  binding 
of  AFP  (or  AFPep)  to  this  receptor  signals  the  cell  to  inhibit  its  own  growth.  The 
steric/electronic  features  of  the  receptor  site  that  permits  AFP  binding  are  largely 
unknown,  making  rational  development  of  lead  compounds  difficult.  We  present  here  a 
novel  strategy  for  developing  new  lead  compounds,  a  strategy  that  uses  molecular 
dynamics  to  explore  the  allowed  conformational  space  of  potentially  active  analogs  in 
solution.  Rational  drug  design  using  molecular  dynamics  in  this  manner  involves 
understanding  the  conformational  space  occupied  by  the  active  compounds  followed  by 
the  creation  of  different  compounds  that  sample  the  same  space.  We  have  used  Replica 
Exchange  (17,  18)  Molecular  Dynamics  (REMD)  techniques  (19)  to  explore  the 
conformational  dynamics  of  several  AFPep  analogs.  Our  REMD  results  reveal  that  the 
peptide's  critical  region  for  activity  is  a  four  amino  acid  sequence  that  adopts  a  Type  I  |3- 


tum  conformation.  We  have  run  REMD  simulations  on  several  different  four-  and  five- 


amino  acid  peptides,  synthesized  those  that  appeared  promising,  along  with  controls,  and 
tested  them  for  activity  using  an  immature  mouse  uterine  growth  assay  (2).  Results  from 
the  REMD  simulations  and  the  experimental  activity  studies  are  presented  in  this  paper, 
and  show  that  peptides  as  short  as  four  amino  acid  residues  retain  biological  activity. 

The  REMD  technique  has  been  successfully  used  to  obtain  energy  landscapes  for 
the  TRP  cage  and  for  the  C-terminal  (3-hairpin  of  protein  G  (20-22),  to  find  the  global 
minimum  for  chignolin  (23),  and  to  explore  unfolding  of  alpha  helical  peptides  as  a 
function  of  pressure  and  temperature  (24).  REMD  has  also  been  used  to  explore 
intrafacial  folding  and  membrane  insertion  of  designed  peptides  (25),  and  protein  folding 
pathways  (26,  27).  We  demonstrate  for  the  first  time  that  REMD,  besides  being  an 
excellent  technique  for  probing  peptide  and  protein  structure,  can  be  used  for  drug 
design. 

Results 

Figure  1  shows  the  conservation  of  conformational  space  of  the  five  active  8-mer 
and  cyclic  9-mer  AFP-derived  peptides  previously  shown  (15)  to  have  anti-estrogenic 
activity.  This  figure  shows  the  overlay  of  the  most  representative  peptide  geometries 
obtained  from  the  conformational  family  that  displays  the  common  reverse  turn  motif  in 
each  REMD  simulation.  Four  amino  acids,  TPVN,  are  conserved  in  the  conformational 
space  of  the  active  8-mers  and  cyclic  9-mers.  The  TPVN  sequence  forms  a  reverse  turn 
within  the  longer  peptides,  a  structure  that  is  conserved  across  all  five  REMD 
simulations.  In  a  Type  I  reverse  p-tum,  four  amino  acids  form  a  turn  structure,  which  is 
defined  by  the  phi  (<)))  and  psi  (i|>)  angles  of  the  second  and  third  amino  acids.  An  ideal 


Type  I  p-tum  has  <J)Ap  values  of  -60  and  -30  degrees  for  the  second  amino  acid,  and  <j>A|> 
values  of  -90  and  0  degrees  for  the  third  amino  acid  in  the  four  amino  acid  turn  sequence. 
As  displayed  in  Table  1,  the  cyclic-[EMTPVNPGQ]  and  EMTPVNPG  peptides  have 
average  4>Pro,  t|>pro,  and  <t>Vai,  H’vai  values  that  fall  within  14  degrees  of  a  Type  I  turn.  The 
other  three  simulations  of  the  8-mer  and  9-mer  peptides  have  similar  values  (Figure  1). 

We  ran  REMD  simulations  for  the  TPVN,  TPVNP,  and  PVNP  analogs  to  explore 
the  conformational  space  sampled  by  these  four  and  five  amino  acid  peptides.  The  TPVN 
and  TPVNP  peptides  form  the  same  reverse  turn  seen  in  the  larger,  active  peptides.  As 
displayed  in  Table  1,  the  TPVNP  and  TPVN  analogs  have  (j)^,  values  of 
approximately  -67  and  -19  degrees,  while  the  <t>Va|,  ipVai  values  are  approximately  -91  and  - 
12  degrees.  In  contrast,  the  average  (J)^,  and  c(>va|,  H’vai  values  for  the  three  different 
conformations  of  the  PVNP  peptide  reveal  that  none  of  the  three  structures  is  a  turn 
(Table  1).  The  structure  that  is  sampled  for  41%  of  the  REMD  simulations  has  (j)^,  ijVo 
angles  of  -75  and  -14  degrees,  and  <|>Val,  \lvai  angles  of-109  and  120  degrees.  The 
structure  sampled  for  37%  of  the  REMD  simulation  has  (jjproAlW  of-108  and  1  degrees, 
while  the  structure  sampled  for  27%  of  the  time  has  ^prcAppro  of —107  and  46  degrees.  All 
three  of  these  structures  have  •Ka/Vvai  of -126  and  123  degrees.  Consideration  of  the 
average  valine  angles  alone  show  that  the  PVNP  peptide  does  not  form  a  turn  structure. 

The  representative  dynamics  of  three  analogs  containing  the  conserved  TPVN 
sequence  can  be  visualized  in  Figure  2.  The  top  graphs  display  the  distance  between  the 
C„  atoms  of  the  threonine  and  asparagine  residues,  serving  as  a  definitive  diagnostic  for  a 
|3-tum.  Twenty-five  percent  of  (3-tums  have  no  intratum  hydrogen  bond,  so  an  alternative 
definition  of  a  p-tum  is  that  the  distance  between  the  C„  carbon  atoms  of  amino  acid 


residues  one  and  four  in  the  tetrapeptide  sequence  is  less  than  7  A  (2S).  The  TPVN 
tetrapeptide  has  a  Ca(X)  -  CU(N)  distance  less  than  7  A  for  64%  of  the  simulation,  indicating 
a  p-turn.  The  TPVNP  pentapeptide  adopts  the  turn  structure  for  75%  of  the  simulation, 
and  the  cyclic-[EMTPVNPGQ]  peptide  is  locked  into  the  turn  structure  for  99%  of  the 
simulation.  The  bottom  graphs  in  Figure  2  show  the  corresponding  three-dimensional 
plots  of  (J>  versus  \ji  versus  simulation  time.  These  plots  reveal  the  dynamics  of  the  4>Ap 
values  for  proline  (red)  and  valine  (green)  amino  acids  throughout  the  simulation  for 
these  three  peptides.  These  plots  confirm  that  these  three  peptides  adopt  a  p-turn 
conformation  over  the  course  of  the  simulation. 

Several  4-mer  and  5-mer  peptide  analogs  containing  the  TPVN  sequence,  or  a 
similar  sequence  with  hydroxyproline  (O)  substituted  for  proline,  were  synthesized, 
tested,  and  compared  to  the  original  8-mer  peptides  for  biological  activity.  As  shown  in 
Table  2,  biological  activity,  as  defined  by  inhibition  of  estrogen-stimulated  growth  of 
immature  mouse  uterus,  was  retained  in  the  TPVN  and  TOVN  4-mers  and  even  more  so 
in  the  TPVNP  and  TOVNO  5-mers. 

The  OVNO  and  PVNP  analogs,  which  were  thought  to  represent  the 
pharmacophore  region  of  AFPep  (/5),  did  not  show  significant  activity.  Similarly,  five 
amino  acid  peptides  containing  the  amino  or  carboxyl  end  of  AFPep  did  not  have 
biological  activity  (bottom  of  Table  2).  The  7-mer,  EMTPVNP,  was  slightly  less  active 
than  AFPep  and  the  smaller  TPVNP. 

The  figures  reveal  why  the  TPVNP  and  TPVN  analogs  are  active.  All  active 
peptides  have  a  conserved  reverse  P-tum  motif.  These  P-tums  are  formed  by  the  TPVN 
sequence,  with  a  hydrogen  bond  formed  between  the  carbonyl  oxygen  of  the  first  amino 


acid  and  the  amide  hydrogen  of  the  fourth  amino  acid.  The  conservation  of  proline  in  the 
second  position  favors  the  formation  of  a  reverse  turn.  This  proline  is  conserved  in 
human,  gorilla,  chimpanzee,  horse,  rat,  and  mouse  AFP  sequences  (75). 


Figure  1.  Overlay  of  the  cyclic-[EKTPVNPGN]  (red),  cyclic-[EKTPVNPGQ]  (blue), 
cyclic-[EMTPVNPGQ]  (orange),  and  the  EMTPVNPG  (green)  and  EMTPTNPG 
(purple)  peptides  from  REMD  simulations.  Each  structure  represents  the  p-tum  motif 
sampled  during  the  dynamics. 


Figure  2.  The  top  graphs  depict  the  distances  between  the  conserved  threonine  and 
asparagine  C„  atoms  as  a  function  of  simulation  time  for  the  TPVN  (left),  TPVNP 
(middle),  and  cyclic-[EMTPVNPGQ]  peptides.  The  bottom  graphs  depict  their 
corresponding  phi  (x-axis)  and  psi  (y-axis)  angles  as  a  function  of  simulation  time  for  the 
conserved  proline  (red)  and  valine  (green)  amino  acids. 


Table  1.  Average  4>Aj)  angle  values  for  the  second  and  third  amino  acids  for  five  different 
sequences.  The  four  residues  in  boldface  define  amino  acids  one  through  four  for  each 
peptide.  The  <()/\^  angle  values  for  the  second  and  third  amino  acids  of  a  four  amino  acid 
sequence  are  diagnostic  for  a  p-tum  structure. 


♦2 

*2 

♦3 

Type  I  p-tum 

-60 

-30 

-90 

0 

Cyclic-[EMTPVNPGQ] 

-68 

-16 

-96 

-9 

EMTPVNPG3 

-68 

-19 

-92 

-11 

TPVNP 

-66 

-19 

-94 

-12 

TPVN 

-68 

-19 

-88 

-11 

PVNP  41% 

-75 

-14 

-126 

123 

32% 

-108 

1 

-126 

123 

27% 

-107 

46 

-126 

123 

a.  The  sequence  of  the  AFPep  peptide. 


Table  2.  Effect  of  AFP-derived  peptides  on  estrogen-stimulated  growth  of  immature 
mouse  uterus.  Estradiol  and  peptide  injection  into  immature  mice  was  carried  out  as 
described  in  Material  and  Methods.  Peptide  was  injected  intraperitoneally  at  a  dose  of 
1  pg  per  mouse. 


Sequence 

%  Inhibition  ±  SE 

EMTPVNPG 

34  ±3“ 

EMTOVNOG 

32  ±4“ 

EMTPVNP 

19  ±  2 

TOVNO 

31  ±4a 

TPVNP 

26  ±  r 

TPVN 

24  ±  5a 

TOVN 

22  ±  r 

OVNO 

6  ±  5 

PVNP 

5  ±  3 

PGVGQ 

0 

EMTPV 

0 

EMTOV 

0 

a.  p  <  0.05  when  compared  to  group  stimulated  with  E2  alone.  Wilcoxon  Rank-Sum  Test. 


Discussion 


We  have  used  REMD  simulations  to  sample  the  conformational  space  of  8-mer  and  9- 
mer  AFP-derived  peptides  that  have  anti-estrogenic  and  anti-breast  cancer  activity.  We 
discovered  that  an  identical  four  amino  acid  sequence  had  minimal  conformational 
flexibility,  suggesting  that  this  region  is  essential  for  the  biological  activities  of  these 
peptides.  The  TPVN,  TOVN,  TPVNP,  and  TOVNO  sequences  were  subsequently 
synthesized  and  were  found  to  be  biologically  active.  This  is  a  novel  finding  because 
Mesfrn  et  al .(//)  and  DeFreest  et  al.(/5)  had  concluded  that  the  8-mer,  EMTPVNPG,  was 
the  minimum  number  of  residues  that  an  AFP-derived  peptide  can  have  and  still  retain 
significant  anti-estrogenic  activity.  This  conclusion  was  based  on  the  findings  that  the  7- 
mers,  MTPVNPG  and  EMTPVNP,  had  relatively  less  biological  activity  than  the  8-mers. 
These  investigators  concluded  that  the  8-mer  peptide  assumed  a  pseudo-cyclic 
conformation,  that  the  middle  of  the  peptide  contained  its  pharmacophore  region  (the 
PVNP  sequence),  and  that  the  exterior  residues  were  essential  for  holding  the  peptide  in 
its  active  conformation.  The  7-mers  lost  activity,  which  was  assumed  to  result  from  the 
loss  of  hydrogen  bonding  between  the  first  and  eighth  amino  acid  residues,  suggesting 
that  full  activity  must  require  at  least  eight  amino  acid  residues.  Indeed  the  5-mer, 
EMTPV,  was  synthesized  and  found  to  have  no  significant  activity,  which  supported  their 
conclusion  that  the  8-mer  sequence  was  the  minimal  sequence  for  biological  activity  (14). 
Furthermore,  guided  by  this  hydrogen  bond  hypothesis,  cyclic  9-mers  were  also 
synthesized  and  shown  to  possess  significant  biological  activity  (29).  We  tested  this  8- 
mer  hypothesis  by  examining  the  percentage  of  hydrogen  bond  formation  between  the 
first  and  eighth  amino  acids  for  each  simulation  containing  seven  or  eight  amino  acid 


residues.  For  all  8-mer  and  7-mer  simulations,  hydrogen  bond  formation  between  the  first 
and  the  last  amino  acids  is  observed  for  less  than  1%  of  the  simulation  if  the  distance 
definition  for  a  hydrogen  bond  is  set  at  2.3  A.  This  result  is  contrary  to  the  original 
hypothesis. 

The  REMD  studies  reported  herein  lead  to  a  substantially  different  conclusion 
regarding  the  requirement  for  peptide  activity.  These  studies  indicate  that  the  key  region 
is  the  TPVN  sequence,  since  this  structure  retains  the  turn  conformation  common  in 
every  active  peptide;  the  PVNP  peptide  does  not  sample  a  reverse  turn  structure  (Table  1) 
and  shows  relatively  low  activity  (Table  2).  The  above  discovery  has  significant 
consequences.  First,  a  4-mer  or  a  5-mer  peptide  that  retains  biological  activity  is  less 
expensive  to  synthesize  than  an  8-mer  or  9-mer  peptide,  and  these  smaller  analogs  are 
more  drug-like.  Second,  knowing  the  conformational  space  occupied  by  the  4-mer 
provides  insight  into  the  topology  of  the  unknown  receptor  site  for  these  peptides.  Based 
on  this  study,  the  receptor  site  topology  for  the  AFP  analogue  peptides  is  predicted  to  be 
a  mirror  image  of  a  reverse  (3-tum.  Previous  substitutions  in  the  8-mer  sequences  reveal 
that  when  threonine,  leucine,  or  isoleucine  is  substituted  for  valine,  biological  activity  is 
retained,  while  substitution  of  valine  with  D-valine  or  alanine  results  in  loss  of  activity. 
This  implies  that  the  topology  of  the  receptor  is  stereospecific,  and  branched  amino  acids 
are  essential  for  creation  of  hydrophobic  forces  that  bind  the  receptor  to  the  peptide. 
Finally,  it  leads  to  a  different  conceptual  approach  for  stabilization  of  these  peptides  and 
the  development  of  peptidomimetics.  Peptidomimetics  can  be  designed  based  on  the 
active  4-mer  and  5-mer  peptides,  with  REMD  used  to  ensure  that  the  steric  and  electronic 
nature  of  the  peptides  is  retained.  Developing  a  peptidomimetic  may  not  be  necessary,  for 


as  long  as  the  peptides  are  bioavailable  they  have  the  advantage  of  a  lower  probability  of 
side  effects  compared  to  peptide  mimics.  The  AFP  8-mer  and  cyclic  9-mer  peptides  have 
already  been  shown  to  be  non-toxic  and  bioavailable  in  mice  (2).  Cancer  xenograft  assays 
in  mice  involved  the  administration  of  8-mer  and  cyclic  9-mer  peptides  twice  a  day  for  30 
days,  during  which  there  was  no  change  in  mouse  body  weight,  cage  activity,  fur  texture, 
or  body  temperature.  In  addition  there  were  no  changes  in  size  or  appearance  of  major 
organs  relative  to  the  control  group.  The  uterine  growth  studies  revealed  that  these 
peptides,  unlike  tamoxifen,  did  not  stimulate  murine  uterine  growth;  indeed  they  inhibit 
the  uterine  stimulated  growth  induced  by  tamoxifen  (2,  8).  Thus,  the  peptides  derived 
from  AFP  represent  a  new  class  of  potential  breast  cancer  drugs,  which  are  active  through 
a  new,  yet  to  be  discovered,  receptor. 

Because  there  is  excellent  correlation  between  the  uterine  growth  assay  and  the 
human  breast  cancer  xenograft  assay  with  regard  to  AFPep  peptide  inhibition  of  estrogen- 
stimulated  growth  (2,  14),  the  TOVNO,  TPVNP,  TOVN,  and  TPVN  analogs  that  are  all 
active  in  the  uterine  growth  assay  are  predicted  to  inhibit  human  breast  cancer  growth. 

We  have  begun  the  xenograft  assays  on  these  analogs  to  evaluate  this  prediction. 

Conclusion 

We  have  applied  and  demonstrated  for  the  first  time  that  REMD  simulations  can 
be  used  as  a  novel  drug  design  tool.  We  have  shown  that  REMD  predicts  a  common 
conformation  that  is  shared  between  the  active  linear  8-mer  and  cyclic  9-mer  peptides. 
The  predicted  common  conformation  is  a  conserved  reverse  p-tum,  and  the  smaller 
peptide  analogs  TOVNO,  TPVNP,  TOVN,  and  TPVN  also  contain  the  same  conserved 


reverse  turn.  These  analogs  are  shown  to  inhibit  estrogen-dependent  cell  growth  in  a 
mouse  uterine  growth  assay,  through  interaction  with  a  yet  to  be  discovered  key  receptor, 
and  are  predicted  to  inhibit  human  breast  cancer.  The  5-mer  and  4-mer  peptides  are  new 
discoveries  that  may  lead  to  promising  new  anti-breast  cancer  drugs  (50). 
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SUPPORTING  ONLINE  MATERIAL 


Methods 

We  used  the  AMBER  8  molecular  dynamics  program  package  (19,  31)  with  a 
new  force  field  that  has  been  created  specifically  to  improve  the  representation  of  peptide 
structure  (32).  Simulations  were  run  in  implicit  solvent,  using  the  Generalized  Bom  (GB) 
model  (33)  implemented  in  AMBER  (34)  with  the  default  radii  under  the  multi sander 
framework  (34).  It  has  been  demonstrated  that  the  use  of  continuum  solvent  models  in 
REMD  can  lead  to  over  stabilization  of  ion  pairs  that  affect  the  secondary  structure  (35, 
36),  but  since  we  do  not  have  salt  bridges  in  our  peptide  structure  we  avoid  this  potential 


problem.  Adequate  sampling  of  conformational  space  was  insured  through  the  use  of  the 
Replica  Exchange  algorithm  (17,  18).  REMD  was  developed  as  a  means  to  overcome 
local  potential  energy  barriers.  Compiled  under  AMBER  8,  one-dimensional  REMD 
explores  a  generalized  canonical  ensemble  of  N  non-interacting  replicas  that  undergo 
simulation  separately  but  concomitantly  at  exponentially  related  temperatures,  with 
exchanges  between  replicas  occurring  at  a  specified  time  interval.  The  consequence  of 
this  exchange  is  that  entrapment  in  local  potential  energy  wells  is  avoided.  Those 
replicas,  which  do  become  trapped  within  a  local  well  at  one  temperature,  can  escape 
when  transitioned  to  a  higher  temperature  as  part  of  the  exchange  process.  Thus,  accuracy 
of  conformation  is  maintained  through  analysis  at  low  temperatures  while  simulations  at 
higher  temperatures  efficiently  achieve  exploration  of  the  potential  energy  surface. 

The  sequences  used  for  the  simulations  were  chosen  from  the  set  of  previously 
synthesized  active  analogs  (15).  REMD  simulations  were  run  on  the  cyclic  analogs 
cyclic-[EKTPVNPGN],  cyclic-[EKTPVNPGQ],  cyclic-[EMTPVNPGQ],  and  the  linear 
analogs  EMTPVNPG  and  EMTPTNPG.  In  addition,  REMD  simulations  were  run  on  the 
smaller  analogs  EMTPVNP,  MTPVNPG,  TPVNP,  TPVN,  and  PVNP.  Eight  different 
replicas  were  used,  each  defined  initially  with  the  same  input  structure.  Temperatures 
were  selected  to  agree  with  exponential  growth  such  that  interchange  occurred  within  the 
temperature  group  tempi,  temp0=  265K,  304K,  350K,  402K,  462K,  53 IK,  61  OK  and 
700K.  The  number  of  exchange  attempts  between  neighboring  replicas  was  set  to  1000 
and  the  number  of  MD  steps  between  exchange  attempts  was  defined  as  10000.  Thus,  the 
total  length  for  each  simulation  with  a  time  step  of  0.002  picoseconds  was  20 


nanoseconds  (19). 


The  linear  peptide  analogs  shown  in  Table  2  were  prepared  using  Fmoc  solid- 
phase  synthesis  (14,  15,  29).  The  anti-estrogenic  activity  of  each  peptide  was  determined 
using  the  immature  mouse  uterine  growth  assay  as  described  previously  (2,  7). 
Intraperitoneal  (i.p.)  administration  of  0.5  pg  of  17|3-estradiol  (E2)  to  an  immature  female 
mouse  doubles  its  uterine  weight  in  24  hours  (7).  Swiss/Webster  female  mice,  weighing 
6-8  grams  at  13-15  days  old,  were  obtained  from  Taconic  Farms.  Mice  were  grouped  so 
that  each  group  had  the  same  range  of  body  weights.  Each  group  received  two  sequential 
i.p.  injections  spaced  one  hour  apart.  The  peptide  or  a  saline  solution  control  was 
contained  in  the  first  injectant,  and  E2  or  a  saline  control  was  contained  in  the  second 
injectant.  Twenty-two  hours  after  the  second  injection,  uteri  were  dissected  and  weighed 
immediately.  The  uterine  weights  were  normalized  to  mouse  body  weights  (mg  uterine/g 
body).  Experiments  used  a  minimum  of  five  mice  per  group,  and  the  mean  normalized 
uterine  weight  and  standard  deviations  were  determined  for  each  group.  Percent  growth 
inhibition  in  a  test  group  was  calculated  from  the  normalized  uterine  weights  as  given  by 
equation  1. 

Growth  inhibition  (%)  =  (Full  E2  stimulation  -  E2  stimulation  in  test  group)/(Full  E2 
stimulation  -  No  E2  stimulation)  x  100%  1 

Significance  of  differences  between  groups  was  evaluated  with  the  nonparametric 
Wilcoxon  ranks  sum  test  (one-sided).  Generally,  drug-induced  growth  inhibitions  of  20% 
or  greater  are  significantly  different  (p  <  0.05)  from  the  group  receiving  E2  alone.  Each 
AFPep  analog  was  evaluated  for  antiuterotrophic  activity  in  three  or  more  experiments, 
and  the  mean  growth  inhibition  ±  the  standard  error  for  each  analog  is  reported  in  Table 


2. 
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Abstract: 

A  combined  study  utilizing  ab  initio  theory,  polarizable  continuum  solvation  methods 
and  thermodynamic  cycles  illustrates  the  local  structure  of  the  autoionization  products  of 
water.  The  present  results  are  consistent  with  the  most  recent  interpretation  of  X-ray 
absorption  experiments  of  liquid  water.  The  possible  ionization  products  are  limited  to 
H30+,  H502+,  H9O/,  H7O4',  and  OH'  for  the  clusters  investigated  (H30+(H20)n  and  OH' 
(H20)„,  (n=0-3)).  While  other  ion  structures  may  be  transient  structures,  they  are  not 
predicted  to  be  significant  in  liquid  water.  In  addition,  the  formation  of  different  ion 
products  is  shown  to  be  dependent  on  the  local  water  structure.  These  results  are  also 
consistent  with  the  latest  interpretations  of  water  structure. 

Introduction 

Autoionization  of  water,  or  the  dissociation  of  water  into  its  ionic  components,  is 
one  of  the  most  fundamental  reactions  in  chemistry.  Most  textbooks  describe  the 
dissociation  of  water  one  of  two  ways: 

H20(|)  -*  H+(aq)  +  OH  (aq)  ( 1 ) 

2H20(|)  -*■  H30+(aq)  +  OH  (aq)  (2) 

but  the  nature  of  the  ion  products  is  still  a  subject  of  active  debate.1'31 

The  first  Car-Parrinello  molecular  dynamics  (CPMD)  simulations32  on  aqueous 
hydroxide  solutions  identified  I I9O5  as  the  dominant  structure  of  OH  (aq). 1  The  H9O5' 


[H0'(H20)4]  structure  has  four  water  molecules  hydrogen  bonded  with  OH'  in  an 
approximately  planar  conformation.1  Additional  CPMD  simulations  confirmed  the 
presence  of  H905'  and  additionally  showed  that  at  low  pH,  H3O2'  is  not  present,  while  a 
variety  of  clusters  can  form  as  the  pH  increases.9, 1 1  Spectroscopic  experiments  on  highly 
basic  solutions  led  to  the  prediction  that  H3O2'  is  the  dominant  solution  structure  of  the 
anion.33  The  anion  has  also  been  described  as  a  species  that  has  three  water  molecules 
hydrogen  bonded  to  it,  H7O4',  initially  to  reconcile  experimental  and  computational 
studies  of  ester  hydrolysis.34  Subsequently,  statistical  mechanical  quasi-chemical  theory 
and  adiabatic  ab  initio  molecular  dynamics  simulations,  contrary  to  CPMD  results,  were 
shown  to  support  the  prediction  that  H7O4'  is  the  predominant  structure  in  solution.17' 33  A 
combination  of  neutron  diffraction  with  hydrogen  isotope  substitution  and  Monte  Carlo 
simulations16  lend  support  to  the  H9O5'  species  predicted  by  CPMD  simulations.1  The 
interpretation  of  neutron  diffraction  data  led  to  the  prediction  that  the  number  of  waters 
strongly  bound  to  the  oxygen  of  the  hydroxyl  ion  is  reduced  from  four  to  three  as  the 
concentration  of  hydroxyl  ion  is  increased.20,22  A  recent  review  on  structure  and 
dynamics  of  OH'(aq)  argues  for  a  nonclassical,  hypercoordinated  solvation  structure,  in 
which  H9O5'  is  the  predominate  ion,  but  H7O4'  is  also  present.26  Despite  the  confusion  in 
the  liquid  phase,  it  is  clear  that  in  the  gas  phase  three  waters  surround  the  hydroxide 
anion  to  form  the  first  solvation  shell.14, 36  Ab  initio  calculations  on  gas  phase  OHXH2O),, 
structures  show  that  the  H9O5'  structure  is  less  stable  than  the  H7O4'  structure  with  one 
water  added  to  the  second  shell.37,  38  The  0H'(H20)  gas  phase  anion  is  strongly  bound, 
leading  to  the  disappearance  of  the  HOH  intramolecular  bending  transition  and  a 
dramatic  red-shift  of  the  shared  proton  stretch.39 


While  most  textbooks  represent  H+(aq)  as  the  hydronium  ion,  H30+,  other  studies 
suggest  H502+  or  H9O/  for  the  proper  representation  of  the  cation.40  42  The  dynamics  of 
excess  proton  motion  and  proton  transfer  in  water  have  been  studied  by  both  quantum 
and  classical  simulations.1, 4‘6, 24-27’ 43-48  The  Zundel  cation,  H5O2,  with  its  strongly  bound 
proton  equally  shared  between  two  waters,  has  been  extensively  studied.15' 18,49-52  The 
effects  of  increasing  basis  set  size  and  using  high  levels  of  electron  correlation  for 
protonated  systems  such  as  H5C>2+  are  well  documented.49  A  combined 
spectroscopic/computational  study  predicts  both  the  Zundel  cation,  H5O2,  and  the  Eigen 
cation,  H9O/,  to  be  present  in  bulk  water. 1 3  Protonated  clusters  of  water  molecules  have 
been  studied  in  the  mid-IR  range  and  the  H30+  and  Hs02+  ions  have  been  observed  for  up 
to  27  water  molecules.52, 53  Studies  on  varying  concentrations  of  HC1  solutions  using  X- 
ray  absorption  combined  with  density  functional  theory  have  led  to  the  conclusion  that  in 
highly  acidic  solutions  the  Eigen  form  dominates,  while  the  proton  is  not  as  localized  to  a 
specific  water  under  less  acidic  conditions.25, 29  The  use  of  multi-state  empirical  valence 
bond  (MS-EVB)  approaches  predict  that  the  solvation  structure  of  the  hydrated  proton  in 
water  is  approximately  a  65:35  mixture  of  the  Eigen  and  Zundel  cations.27  Ultrafast 
vibrational  and  structural  dynamics  of  the  proton  in  liquid  water  reveal  that  the  lifetime  of 
the  0-H+  stretching  mode  of  the  Eigen  cation  and  interconversion  between  Eigen  and 
Zundel  hydration  structures  of  the  proton  are  on  the  order  of  100  femtoseconds.30  A 
neutron  diffraction  experiment  with  isotopic  substitution  on  a  concentrated  HC1  solution, 
combined  with  a  Monte  Carlo  simulation,  supports  the  existence  of  either  hydronium 
ions,  Eigen,  or  Zundel  complexes.19,21  The  accuracy  of  the  fit  between  the  data  from 
Monte  Carlo  simulations  and  the  diffraction  data  when  the  starting  hypotheses  for  the 


simulations  are  bare  H+  ions,  all  H30+  ions,  or  all  Hs02+  ions  in  the  simulation  box, 
together  with  water  and  chlorine  ions,  gives  the  same  general  picture.  Agreement  with 
neutron  data  was  achieved  when  every  H+  ion  in  the  simulation  has  at  least  one  water 
molecule  at  a  distance  of  1 .05  A,  which  makes  a  hydronium  ion.  Three  quarters  of  the 
hydronium  ions  have  a  second  water  at  1.32  A,  which  is  described  as  either  a  distorted 
Zundel  or  distorted  Eigen  cation  depending  on  the  other  nearest  water  neighbors  to  these 
complexes.21 

The  nature  of  local  water  structure  in  the  liquid  is  also  controversial,  and  much 
experimental54'65  and  theoretical7, 59, 66-84  research  has  been  directed  at  this  fundamental 
problem.85  92  Benson  and  Siebert  proposed  a  two-structure  model  for  liquid  water  based 
on  elegant  thermodynamic  arguments.66  In  their  model  water  is  a  cubic  octamer,  which 
can  dissociate  into  two  planar  tetramers,  and  they  showed  that  isomerization  between 
clusters  reproduced  the  heat  capacity  of  liquid  water  from  0  to  100°  C.66  Classical 
simulations  of  water  dynamics  predict  results  consistent  with  the  force  fields  in  use,  with 
most  predicting  tetrahedral  networks.67, 75,91  In  general,  water  models  are  developed  to 
reproduce  bulk  liquid  properties,  such  as  the  diffusion  coefficient,  and  the  heat  of 
vaporization.  In  contrast  to  classical  treatments,  quantum  mechanical  approaches  favor  a 
less  symmetrical  tetrahedral  configuration  of  water  building  blocks.73 

Recently,  X-ray  absorption  (XAS)  and  X-ray  Raman  scattering  (XRS) 
experiments  have  been  used  to  probe  the  molecular  arrangement  in  the  first  coordination 
shell  of  liquid  water.60,62  In  contrast  to  the  commonly  reported  tetrahedral  structure  by 
both  theory  and  experiment,  the  data  was  interpreted  to  be  consistent  with  a  local  water 
structure  that  does  not  adopt  a  primarily  tetrahedral  water  structure.60, 62  Rather,  each 


water  has  two  hydrogen  bonded  configurations,  as  a  strong  donor  and  as  a  strong 
acceptor.  This  model  of  water  is  of  ring  structures  or  linear  chains  embedded  in  a 
disordered  water  cluster  network.  These  recent  results  have  called  into  question  the 
results  from  ab  initio  and  classical  molecular  dynamics  simulations.60  Infrared 
spectroscopy  measurements  compared  against  the  percolation  model  lead  to  the 
prediction  that  at  298  K  the  average  coordination  number  is  2.4,  with  the  dominating 
population  consisting  of  molecules  with  three  hydrogen  bonds.61 

Several  subsequent  studies  have  questioned  the  results  from  this  study,  and 
produced  evidence  further  reaffirming  the  existence  of  tetrahedral-type  networks.  Raman 
spectra  collected  from  278  -  353  K  combined  with  Monte  Carlo  simulations 
implementing  a  commonly  used  classical  water  model  support  a  tetrahedral  description  of 
liquid  water.90  Recently,  the  Raman  depolarization  ratio  of  water  as  a  function  of 
temperature  has  been  interpreted  as  a  transition  from  tetrahedrally  coordinated  water  to 
locally  planar  water  structures  above  the  phase  transition  temperature  of  35°  C.65  In 
addition,  two  separate  reinterpretations  of  the  XAS/XRS  data  were  shown  to  support 
tetrahedral  coordination.92' 93  Reevaluation  of  the  H-bond  criteria  utilized  to  evaluate  the 
XAS  data  brings  the  XAS  results  into  agreement  with  other  experimental  data  (i.e.  TEY- 
NEXAFS).93  Combination  of  the  results  from  X-ray  scattering  data,  which  can  access 
information  outside  the  range  of  XAS,  confirmed  the  presence  of  the  asymmetric 
character  observed  in  the  XAS  experiment,  but  not  as  the  dominate  bulk  water  structure. 
The  local  configuration  of  liquid  water  was  still  shown  to  be  primarily  that  of  a 
tetrahedrally  coordinated  random  network.92  In  addition  a  combination  of  femtosecond 
2D  IR  spectroscopy  and  molecular  dynamics  simulations  reveal  that  hydrogen  bonds  are 


only  broken  fleetingly  in  liquid  water,  and  that  nonhydrogen-bonding  configurations 
return  to  a  hydrogen-bonding  partner  within  200  fs.S9 

Recent  work  using  first  principles  molecular  dynamics  simulations  indicates  that 
hybrid  density  functionals  may  accurately  model  water  once  longer  simulation  times  can 
be  reached.80  Preliminary  results  suggest  that  each  water  has  an  average  of  3.48,  3.58, 
and  3.67  hydrogen  bonds  for  the  PBEO,  X3LYP,  and  B3LYP  functionals.80  The  B3LYP 
functional  performs  best  when  compared  with  benchmark  calculations  for  autoionization 
of  a  water  octamer,  and  has  been  suggested  as  the  preferable  functional  for  Car-Parinello 
molecular  dynamics  (CPMD)  and  quantum  mechanics/molecular  mechanics  simulations 
of  autoionization  of  water.94  Computation  of  the  X-ray  absorption  spectrum  that  would 
result  from  molecular  dynamics  simulations  leads  to  the  prediction  that  the  BLYP  CPMD 
simulations  do  not  represent  the  actual  water  structure.82  A  careful  comparison  of  CPMD 
and  Bom-Oppenheimer  molecular  dynamics  simulations  of  liquid  water  at  a  temperature 
of  423K  and  a  density  of  0.71  g  cm'3  yield  consistent  results  for  the  liquid  structure  and 
the  self-diffusion  coefficient.95  The  statistical  uncertainties  in  the  self-diffusion 
coefficient  are  quite  large,  and  the  authors  point  out  that  a  CPMD  simulation  of  500  ps 
would  be  required  to  obtain  a  self-diffusion  coefficient  with  a  statistical  uncertainty  of 
50%.95  Correlation  of  experimental  data  with  computed  properties  for  liquid  water  is  not 
straight  forward  and  subject  to  a  variety  of  assumptions  about  the  local  structure  of  bulk 
water  and  its  energetic  properties.75, 83,87  Additionally,  the  unique  aqueous  phase 
properties  emanating  from  the  small  molecular  unit  of  water  prevail  and  continue  to 
challenge  both  theoreticians  and  experimentalists  alike,  such  as  the  full  characterization 
of  the  autoionization  of  water.59 


A  few  essential  questions  remain  unanswered:  for  example,  what  exactly  are  the 
ionic  states  of  the  autoionization  products  of  water,  H+,  H30+,  HsO/,  H703+  or  H9O/  and 
what  are  the  corresponding  anions  associated  with  the  cations?  High  level  ab  initio 
theory,  polarizable  continuum  solvation  methods  and  thermodyamic  cycles  of  the 
dissociation  of  neutral  water  at  298.15  K  were  implemented  to  address  these  questions, 
and  are  reported  in  this  paper. 

Methods 

The  following  thermodynamic  cycle  was  utilized  to  evaluate  the  different 
possibilities  for  the  dissociation  of  water  into  its  corresponding  cations  and  anions: 


AGgas 


AlH2Ogas 

-*• 

(H20)w.jc0H  gas 

+ 

(H20)xH 

1 AGS0! 

1  AGsol 

1  AGsoi 

«H2Oaq 

(H20)™0Haq 

+ 

(H2o)xH 

AGaq 


In  which  AGaqcan  be  calculated  from  the  thermodynamic  cycle  as  follows: 

AGaq  =  AGgas  +  AGso,[(H20)n.xOH  ]  +  AGS0|[(H20),H+]  -  AGso)(«H20)  (4) 

or 

AGaq  =  AGgas  +  AAGS0|  (5) 

where 

AAGsoi  =  AGSOi[(H20)„.,OH  ]  +  AGSOi[(H20),H+]  -  AGso,(hH20)  (6) 

Since  the  experimental  value  for  AGaq  is  known  for  each  possible  cycle,  the 
corresponding  ion  clusters  can  be  deduced  based  on  the  fit  to  the  experimental  data.  Gas 
phase  geometry  optimized  structures  were  located  at  the  HF/6-31 1++G(2d,p)  and  MP2/6- 
3 1 1 ++G(2df,2pd)  levels  as  well  as  with  the  Model  Chemistry  Methods  G2%  and  CBS- 


APNO97  using  Gaussian  98. 98  All  structures  were  verified  as  minima  on  the  potential 
energy  hypersurface  through  frequency  calculations.  Energies  were  computed  with 
coupled  cluster  theory99’ 100  with  single,  double  and  triple  substitutions.101  105  Triple 
excitations  were  included  for  both  the  complete  MP4  and  CCSD(T),  and  all  inner  shell 
electrons  were  included  in  the  electronic  correlation  calculations.  Free  energies  at 
298.15  K  were  obtained  by  correcting  the  gas  phase  energies  with  the  results  from  the 
zero-point  and  thermal  energies  at  the  HF/6-3 1 1++G(2d,p)  or  MP2/6-3 1 1++G(2df,2pd) 
levels  of  theory.  Additionally,  gas  phase  values  were  corrected  from  a  standard  state  of 
1  atm  to  a  standard  state  of  1  M  in  order  to  be  used  in  the  thermodynamic  cycle.106 1 1 1 
The  effect  of  using  water  clusters  as  reactants  was  gauged  by  including  a  water  dimer, 
tetramer,  hexamer  and  an  octamer  in  addition  to  the  usual  monomer  representation. 

Values  for  the  free  energy  of  solvation  were  calculated  using  the  polarizable  conductor 
model  (CPCM),112  which  is  based  on  the  Polarized  Continuum  model  (PCM).113'119  All  of 
the  solvation  computations  were  performed  on  gas  phase  geometries  optimized  at  the 
HF/6-3 1 1++G(2d,p)  level,  using  the  6-31G(d)  and  6-31+G(d)  basis  sets.  Additionally,  the 
surface  of  each  sphere  was  subdivided  into  300  triangular  tesserae,  and  the  area  of  the 
tesserae  was  set  to  0.3  A2. 

Results 

Detailed  comparison  of  the  G2,  G3,  CBS-QB3,  and  CBS-APNO  model  chemistry 
methods  against  experimental  and  high  level  ab  initio  predictions  have  verified  the 
appropriateness  of  these  methods  for  water  cluster120'122  and  H+(H20)„  and  OHXH2O),, 
ionic  cluster  structure  calculations.123  Additionally,  the  ability  of  the  model  chemistry 
methods  to  accurately  model  the  thermochemistry  for  cluster  formation  of  neutral  and 


ionic  clusters  has  been  documented.123 125  The  AGsoi  for  H30+(H20)„  and  OH'(H20)„ 
clusters,  which  include  a  maximum  of  three  waters,  and  (H2O),,  (n=  1,2, 4,6,  and  8)  has 
been  determined  utilizing  CPCM/HF/6-3  lG(d)//HF/6-3 1 1++G(2d,p)  and  CPCM/HF/6- 
31+G(d)//HF/6-31 1++G(2d,p),  Table  1.  Utilizing  the  values  for  AGsoi  shown  in  Table  1, 
the  AAGsoi  values  were  determined  for  the  dissociation  of  water  into  the  aforementioned 
ion  products,  Table  2.  Again,  the  values  for  AGsoi  shown  in  Table  1  were  utilized  to 
determine  AGaq  for  a  series  of  reactions,  in  which  various  water  clusters  and  monomers 
dissociated  to  form  various  forms  of  the  hydronium  and  hydroxide  ions,  bottom  of  Table 
3.  The  corresponding  gas  phase  free  energies,  AGgas,  (using  G2,  CCSD(T),  and  CBS- 
APNO)  have  also  been  determined. 


Table  1.  AGSOi  for  each  species  investigated  using  structures  optimized  at  the  HF/6-31 1++G(2d,p)  level  of 


Specie 

CPCM/HF/ 

6-3  IG(d) 

CPCM/HF/ 

6-31-Ki(d) 

Exp.* 

HiO 

-105.58 

-105.89 

-105.00 

OH 

-112.11 

-111.45 

-110.00 

H:C) 

-6.35 

-7.23 

-6.31 

(H.O). 

-9.56 

-10.61 

h3o2’ 

-76.87 

-77.45 

HA 

-78.21 

-80.03 

(H,o)4 

-7.92 

-9.10 

h7o/ 

-68.68 

-69.68 

HjO, 

-74.82 

-77.27 

(HA, 

-6.08 

-7.14 

HA' 

-56.68 

-57.88 

HA 

-59.39 

-61.56 

(HA, 

-4.63 

-5.66 

"References 

no  .ns;  "it? - 

Table  2.  AAGSOi  calculated  for  each  representation  of  the  ionization 

CPCM/HF/  CPCM/I 

Reaction 

6-31G(d) 

6-3  l-»-G( 

2H20 

— 

H30+  +  OH' 

-204.99 

-202.88 

(H20)2 

-* 

H30+  +  OH 

-208.13 

-206.73 

4H20 

— 

h5o2+  +  h3o2* 

-129.68 

-128.56 

(h20)4 

— 

h5o2+  +  h3o2- 

-147.16 

-148.38 

6H20 

h7o3+  +  H503 

-105.40 

-103.57 

(H20)6 

— 

h7o3+  4-  h5o3 

-137.42 

-139.81 

8H20 

— ► 

H904+  4-  h7o4 

-65.27 

-61.60 

(H20)8 

— 

H904+  4-  h7o4 

-111.44 

-113.78 

3H20 

— 

h3o+  +  h3o2 

-164.74 

-164.23 

3H20 

— 

H502+  +  OH' 

-169.93 

-167.21 

5H20 

— 

H502+  4-  H30+  4-  20H 

-374.92 

-370.09 

5H20 

— 

2H30+  +  0H  +H302 

-369.73 

-367.11 

6H20 

h3o+  +  h5o2+  +  oh 

+  H302-  -334.67 

-331.44 

Exp. 


-202.38 


Table  3.  Free  energies  of  ionization  in  the  gas  phase  (top)  and  aqueous  phase  (bottom). 


AGgas 

Reaction 

CCSD(T) 

G2 

CBS-APNO 

Exp. 

2H20 

— 

H30+  +  0H* 

227.08 

225.42 

225.27 

226.23 

(H20)2 

— 

H30+  +  OH* 

227.16 

225.10 

4H20 

— 

H502+  +  H302* 

177.48 

177.56 

177.31 

(H20)4 

— 

h5o2+  4-  h3o2 

181.56 

6H20 

— 

h7o3+  +  h5o3 

145.50 

150.37 

149.58 

(H20)6 

— 

h7o3+  +  h5o3* 

151.24 

8H20 

— 

H904+  +  H704 

120.49 

126.16 

127.60 

(H20)8 

— 

h9o/  4  h7o4* 

131.15 

3H20 

— 

h3o+  4  h3o2* 

205.22 

205.19 

205.04 

3H20 

— 

H502+  4  OH* 

199.35 

197.80 

197.46 

5H20 

— 

H502+  +  H30+  4  20H 

426.43 

423.22 

422.83 

5H20 

— 

2H30+  +  OH*  4  H302* 

432.31 

430.61 

430.31 

6H20 

— 

H30+  4  H502+  4  OH*  4  H302* 

404.56 

402.98 

402.58 

AGaq 

Reaction 

CCSD(T) 

a  b 

G2 

a  b 

CBS-APNO 

a  b 

Exp. 

2H20 

— 

H30+  4  OH* 

22.09 

24.20 

20.43 

22.51 

20.28 

22.39 

23.85 

(h20)2 

— 

H30+  4  OH 

19.03 

20.43 

16.97 

18.37 

21.08 

4H20 

— 

H502+4H302* 

47.80 

48.92 

47.88 

49.00 

47.63 

48.75 

28.60 

(h20)4 

— 

H502+4H302 

34.40 

33.18 

20.70 

6H20 

— 

H703+4Hs03 

40.10 

41.93 

44.97 

46.80 

44.18 

46.01 

33.36 

(H2o)6 

— 

H703+4H503* 

13.82 

11.43 

20.43 

8H20 

— 

h9o4+  4  h7o4* 

55.22 

58.89 

60.89 

64.56 

62.33 

66.00 

38.12 

(h20)8 

-* 

h9o4+  4  h7o4 

19.71 

17.37 

20.26 

3H20 

— 

h3o+  4  h3o2 

40.48 

40.99 

40.45 

40.96 

40.30 

40.81 

26.24 

3H20 

— 

H502+  4  OH* 

29.42 

32.14 

27.87 

30.59 

27.53 

30.25 

26.24 

5H20 

— 

H502+  4  H30+  4  20H* 

51.51 

56.34 

48.30 

53.13 

47.91 

52.74 

31.02 

5H20 

— 

2H30+  4  OH*  4  H302* 

62.58 

65.20 

60.88 

63.50 

60.58 

63.20 

31.02 

6H20 

— 

H30+  4  H502+  4  OH*  4  H302 

69.89 

73.12 

68.31 

71.54 

62.91 

71.14 

33.36 

All  values  are  in  kcal/mol.  See  text  for  description  of  determination  of  experimental  values. 
aCPCM/6-3 1  G(d),  bCPCM/6-3 1  G+G(d) 


Discussion 

The  experimental  values  for  the  change  in  free  energy  upon  solvation  of  H30+, 
OH',  and  H2O  are  known,110’ 126, 127  and  these  values  are  compared  with  the  CPCM 
calculated  values  in  Table  1.  Both  basis  sets  are  within  reasonable  agreement  with  the 
experimental  value  for  H30+,  OH',  and  H2O.  The  best  agreement  with  the  experimental 
AAGSoi  value  is  achieved  from  the  dissociation  of  2  water  monomers  to  form  a  hydronium 
and  hydroxide  ion.  Use  of  the  6-31+G(d)  basis  set  overestimates  the  experimental  value 


by  almost  2  kcal/mol  for  both  the  hydroxide  ion  and  the  water  monomer.  Therefore, 
cancellation  of  this  error  allows  the  CPCM/6-31+G(d)  method  to  produce  the  most 
accurate  value  for  AAGSOi  for  the  first  reaction  in  Table  2.  Whether  this  cancellation  of 
errors  is  a  common  theme  among  the  remaining  reactions  listed  in  Table  2  has  not  been 
verified,  but  since  the  CPCM  method  was  parameterized  using  HF/6-31G(d)  geometries 
and  the  value  for  AGSOi  of  water  was  included  in  the  parameterization  set,  the  error  for 
AGS0|  of  water  is  expected  to  be  lower  for  CPCM/6-31G(d).  The  analogous  ionization 
from  the  water  dimer  is  also  in  agreement  with  the  experimental  value  of  AAGS0|.  Table  3 
contains  the  values  for  electronic  energies  obtained  for  H30+,  OH',  two  water  monomers, 
and  for  the  water  dimer  with  computations  at  the  CCSD(T)  level  on  the  HF/6- 
31 1++G(2d,p)  structures,  and  using  the  G2  and  CBS-APNO  model  chemistries.  This 
table  also  includes  the  results  calculated  for  AGaq  obtained  from  combining  the  AGgas 
quantities  with  the  AAGSOi  values,  shown  in  Table  2.  The  experimental  value  for  Kc,  and 
thus  AGaq,  depends  on  the  balanced  chemical  equation  2H2O  <»  H30+  +  OH',  yielding  K* 
=  [H30+][0H']/[H20]2  =  Kw/[H20]2.  From  the  value  of  Kw  (1.01 16  x  10'14  at  298  K)128 
and  the  concentration  of  water  (55.34  M  at  298  K)  Kc  =  3.3025  x  10'18  and  AGaq  = 

23.85  kcal/mol.  The  corresponding  values  of  AGaq  calculated  for  the  reaction  2H20  <=> 
H30+  +  OH'  are  22.09,  20.43,  and  20.28  kcal/mol  for  the  CCSD(T),  G2,  and  CBS-APNO 
methods,  respectively,  combined  with  the  CPCM/HF/6-31G(d)  solvation  method.  All  of 
these  numbers  compare  quite  well  to  the  experimental  value  of  23.85  kcal/mol. 

Assuming  that  the  local  water  structure  can  be  represented  as  a  more  structured 
entity,  clusters  can  be  used  in  the  place  of  monomers  as  the  bulk  water  model,  with  the 
smallest  possible  model  being  the  water  dimer.  For  the  reaction  of  the  dimer,  (H20)2  <=> 


H30+  +  0H\  Kc  =  [H30+][0H']/[(H20)2]  =  Kw/[(H20)2]  .  If  bulk  water  is  actually  made 
up  of  water  dimers,  then  the  concentration  of  water  dimers  would  be  27.67  M,  so  that  Kc 
=  3.65595  x  1  O'16,  and  AGaq  =  21.08  kcal/mol.  Computation  of  AGaq  for  the  dissociation 
of  the  water  dimer  into  H30+  and  OH'  in  aqueous  solution  yields  values  for  the  CCSD(T) 
and  CBS-APNO  methods  combined  with  the  CPCM/6-3  lG(d)  solvation  method  (and 
CPCM/6-3 1+G(d))  that  deviate  by  2.05  and  4. 1 1  kcal/mol  (0.65  and  2.71  kcal/mol)  from 
the  experimental  value  of  21.08  kcal/mol.  Although  these  values  are  in  worse  agreement 
with  the  experimental  AGaq  value  than  for  the  analogous  dissociation  of  the  water 
monomer,  the  error  for  these  methods  is  still  within  2.1  kcal/mol  for  the  CCSD(T) 
calculations  combined  with  either  continuum  solvation  method.  Alternative  dissociations 
of  water  were  explored  to  gain  insight  into  both  water  structure  and  the  structure  of  the 
ionic  dissociation  products.  Utilizing  various  water  clusters  consisting  of  up  to  8  water 
molecules,  the  only  system  besides  the  dimer  that  is  in  agreement  with  experiment  is  the 
dissociation  of  the  water  octamer  into  H9O/  and  H7O4'  (Table  3). 

Further  investigation  of  the  dissociation  of  water  monomers,  with  a  maximum  of 
8  constituents,  into  various  ion  products  reveals  that  one  additional  reaction  is  in 
agreement  with  experiment  (taking  into  account  the  error  of  the  computation):  3H20  <=> 
Hs02+  +  OH'.  Therefore,  the  H502+  ion  is  also  predicted  to  be  formed,  which  agrees  with 
experiment. I3,21, 52, 53  The  energy  associated  with  the  formation  of  the  Zundel  cation  was 
only  found  to  be  in  agreement  with  experiment  when  the  local  bulk  water  description  was 
that  of  solvated  monomers.  Since  a  less  structured  model  for  liquid  water  is  supported  by 
recent  XAS/XRS  results,  the  description  of  the  local  water  structure  as  solvated 
monomers  has  experimental  and  theoretical  support. 


The  water  octamer  is  an  interesting  model,  since  it  does  represent  a  primarily 
tetrahedral-like  structure.  In  a  D2d  or  S4  cubic  octamer,  each  water  in  the  cube  forms  three 
hydrogen  bonds  with  neighboring  waters,  and  each  water  can  serve  either  as  a  hydrogen 
bond  acceptor  or  donor  to  one  additional  neighboring  water  by  virtue  of  the  tetrahedral 
shape  displayed  by  the  cube.121  Thus  the  formation  of  the  H9O/  and  H7O4’  ions  from  the 
water  octamer  fits  with  the  notion  that  water  structure  is  primarily  tetrahedral. 

Given  the  errors  in  the  methods,  which  lead  to  an  uncertainty  in  the  final  free 
energies  of  several  kcal/mol,  these  results  lead  us  to  predict  that  all  five  species,  l-hO4, 
H502+,  H9C>4+,  H7O4',  and  OH",  form  from  water  dissociation.  While  other  ionic  species 
may  form  transiently  in  liquid  water,  the  thermodynamic  analysis  presented  here  supports 
the  view  that  autoionization  of  water  is  limited  to  the  five  ion  products  mentioned  above. 
One  caveat  is  that  we  have  not  looked  at  ion  products  containing  more  than  three  water 
molecules  (i.e.  H30+(H20)n  and  0H"(H20)„  where  n  >  3).  The  possibility  that  HjO+  and 
OH"  are  autoionization  products,  and  do  not  always  exist  as  tightly  bound  clusters,  is 
consistent  with  more  recent  experimental  studies  emphasizing  the  dynamic  nature  of 
liquid  water.31, 60,62 

Previous  thermodynamic  work  on  the  structure  of  water  led  to  a  tetramer/octamer 
two-state  model.66  The  cyclic  tetramer,  cyclic  pentamer,  and  cubic  octamers  are  very 
stable  structures  in  the  gas  phase.  ’  ’  ’  The  quantum  chemical  models  employed 

in  the  present  study  lead  to  the  prediction  that  the  primary  ions  formed  in  autoionization 
are  H30+,  Hs02+,  H9C>4+,  H7O4",  and  OH".  The  formation  of  larger  ions  (H904+  and  H7O4  ) 
directly  from  water  autoionization  depends  upon  the  presence  of  a  more  structured  local 
configuration  of  liquid  water.  Of  the  ionization  reactions  investigated  in  this  paper,  only 


the  presence  of  a  water  octamer  would  result  in  the  formation  of  the  larger  ions.  The 
thermodynamic  analysis  presented  here  shows  that  formation  of  these  two  larger  ions 
from  eight  solvated  water  monomers  is  not  in  agreement  with  experiment. 

The  use  of  two  different  models  to  represent  the  local  structure  of  bulk  water 
yields  two  different  groups  of  ionization  products.  Within  the  representation  that  the  local 
water  structure  can  be  represented  as  solvated  monomers  or  dimers,  HaO+,  HsCS*,  and 
OH'  are  the  ionization  products.  In  contrast,  a  loose  configuration  of  water  molecules 
forming  a  cubic  octamer  structure  is  in  alignment  with  the  idea  of  a  tetrahedral-like  local 
water  structure,  and  this  local  structure  would  make  the  dissociation  of  water  into  the 
H9O/  and  H7O4'  ion  products  plausible.  Based  on  the  current  interpretation  of 
experimental  data  it  is  probable  that  a  variety  of  local  structures  exist,  with  a  tetrahedral- 
like  structure  dominating  bulk  water,  yielding  support  for  both  local  water  descriptions 
described  here.  The  view  that  bulk  water  solvates  individual  monomers  (and  possibly 
dimers),  and  is  also  a  tetrahedral-like  structure,  directly  limits  the  structures  of  the 
resulting  ion  products.  The  success  of  continuum  solvation  models  for  the  representation 
of  bulk  water  surely  implies  that  the  view  of  local  structure  as  bulk  water  (simulated  by  a 
dielectric  continuum)  solvating  individual  monomers  has  a  strong  theoretical  basis.131  In 
terms  of  the  tetrahedral-like  local  water  structure,  we  note  that  there  is  a  direct  correlation 
between  the  increase  in  the  strength  of  the  local  hydrogen  bonding  interactions,  i.e.  8H2O 
-*  (H2CO8,  and  the  formation  of  larger  ion  clusters,  i.e.  HgCV  and  H7O4',  which  also  have 
strong  local  hydrogen  bonding  interactions.  In  summary,  the  thermodynamics  that  control 
the  formation  of  specific  ion  products  is  dependent  on  the  local  water  structure. 


The  ability  of  this  method  to  predict  the  formation  of  the  three  commonly  thought 
to  exist  ions,  Hs02+,  H9O/,  and  H7CV,  not  only  leads  to  additional  confirmation  of  the 
presence  of  these  ions,  but  also  to  a  more  detailed  description  of  the  localized  water 
structure  before  and  after  ion  formation.  The  structure  of  water  predicted  from  these 
results  is  limited  to  monomers,  octamers,  and  possibly  dimers.  The  octamer  model 
accounts  for  the  mostly  tetrahedral  arrangement  of  water  molecules  and  accounts  for  the 
formation  of  the  HgO^  and  H7CV  ion  products. 

The  traditional  Eigen  structure,  H9O4  ,  is  often  described  as  the  hydronium  ion, 
H30+,  solvated  by  three  other  water  molecules.  Many  researchers  in  the  field  actually 
refer  to  H30+  as  the  Eigen  structure,52’ 53  which  can  be  a  source  of  confusion.  Our  results 
suggest  that  the  hydronium  ion  is  constantly  forming,  breaking,  and  reforming  hydrogen 
bonds  with  water  molecules.  Femtosecond  vibrational  pump-probe  spectroscopy  was 
utilized  to  monitor  the  0-H+  stretching  mode,  revealing  that  the  protonic  stretching  mode 
has  a  lifetime  of  only  120  fs.30  These  results  also  support  a  less  than  100  fs  transition 
from  one  hydration  structure  to  another,  which  is  predicted  to  be  from  the  Eigen  to  the 
Zundel  structure.30  From  the  model  implemented  here,  both  of  these  structures  are 
predicted.  Additionally,  the  fast  transition  also  lends  support  to  the  idea  that  other  clusters 
are  also  likely  to  be  present,  such  as  the  weakly  coordinated  hydronium  ion  predicted 
here.  This  result  is  consistent  with  conclusions  based  on  CPMD  simulations4, 7' 44  and  X- 
ray  absorption  experiments.25,29  This  result  aligns  with  the  idea  of  proton  transfer  along  a 
hydrogen  bond  as  being  essentially  barrierless,  because  of  the  zero-point  motion  of  the 

4,  30 


proton. 


With  respect  to  the  hydroxide  ion,  the  use  of  thermodynamic  cycles  predict  two 
conformations,  both  the  previously  proposed  configuration  in  which  the  hydroxide  ion  is 
strongly  hydrogen  bound  to  three  other  waters,  but,  additionally,  that  the  OH'  ion  itself 
should  also  be  observed  as  more  than  a  transient  species.17,35  Utilization  of  three  different 
density  functionals  in  ab  initio  molecular  dynamic  simulations  led  to  three  models  for  the 
local  configuration  of  the  hydroxide  ion:  primarily  H9O5';  primarily  H7O4';  and  a  mixture 
in  which  H9O5'  is  the  dominate  configuration,  but  H7O4'  also  exists.26  Only  the  last 
model,  in  which  more  than  one  configuration  of  the  hydroxide  ion  is  observed,  were  the 
relative  magnitudes  of  the  diffusion  coefficients  for  H30+,  OH',  and  H2O  reproduced. 
Solvation  of  the  hydroxide  ion  with  greater  than  3  explicit  water  molecules  is  outside  the 
scope  of  the  present  study.  The  ability  of  this  method  to  successfully  distinguish  the 
possible  ionization  products  suggests  the  usefulness  of  this  model  to  determine  the 
relevance  of  local  hydroxide  configurations  solvated  by  additional  water  molecules, 
including  clusters  in  which  the  hydroxide  ion  behaves  as  a  hydrogen  bond  donor,  as 
suggested  by  the  recent  results  from  ab  initio  molecular  dynamics.*6 
Conclusion 

State-of-the-art  quantum  chemistry  methods  and  thermodynamic  cycles  were 
implemented  to  investigate  the  autoionization  of  liquid  water  at  298  K.  Based  on  the 
experimental  dissociation  constants  for  the  different  reactions  reported  in  Table  3,  the 
stable  species  of  water  and  ion  products  were  predicted  from  quantum  computations  and 
continuum  solvation  calculations.  Dissociation  of  liquid  water  to  form  several  ion 
products,  H30+,  H502+,  HgQ^,  H7O4',  and  OH'  are  found  to  be  in  agreement  with  current 
combined  interpretation  of  the  local  water  structure,  in  which  there  is  a  tetrahedral 


network,  but  asymmetric  fluctuations  are  ubiquitous.  All  other  ionic  species  investigated 
here,  which  may  exist  transiently,  are  predicted  not  to  exist  as  discrete  structures. 
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Abstract:  Gaussian-3  and  MP2/aug-cc-pVnZ  methods  have  been  used  to 
calculate  geometries  and  thermochemistry  of  CS2(H20)„,  where  n=l-4.  An 
extensive  molecular  dynamics  search  followed  by  optimization  using  these 
two  methods  located  two  dimers,  six  trimers,  six  tetramers,  and  two 
pentamers.  The  MP2/aug-cc-pVDZ  structure  matched  best  with  the 
experimental  result  for  the  CS2(H20)  dimer,  showing  that  diffuse  functions 
are  necessary  to  model  the  interactions  found  in  this  complex.  For  larger 
CS2(H20)n  clusters,  the  MP2/aug-cc-pVDZ  minima  are  significantly  different 
from  the  MP2(full)/6-31G*  structures,  revealing  that  the  G3  model  chemistry 
is  not  suitable  for  investigation  of  sulfur  containing  van  der  Waals 
complexes.  Based  on  the  MP2/aug-cc-pVTZ  free  energies,  the  concentration 
of  saturated  water  in  the  atmosphere,  and  the  average  amount  of  CS2  in  the 
atmosphere,  the  concentrations  of  these  clusters  are  predicted  to  be  on  the 
order  of  104  CS2(H20)  clusterscm'3  and  106  CS2(H20)2  clusterscm'3  at 
298.15  K.  The  MP2/aug-cc-pVDZ  scaled  harmonic  and  anharmonic 
frequencies  of  the  most  abundant  dimer  cluster  at  298  K  are  presented,  along 
with  the  MP2/aug-cc-pVDZ  scaled  harmonic  frequencies  for  the  CS2(H20)n 


structures  predicted  to  be  present  in  a  low-temperature  molecular  beam 
experiment. 

Introduction 

A  key  atmospheric  reaction  is  the  oxidation  of  carbon  disulfide  by  the  hydroxyl 
radical  to  produce  carbonyl  sulfide,1  an  important  atmospheric  particle.2  OCS  is  the  most 
abundant  sulfur  molecule  in  the  atmosphere  and  further  oxidation  of  this  molecule  leads 
to  the  formation  of  acid  rain.3  Water  clusters  can  catalyze  reactions  with  sulfur  species,4  4 
and  various  theoretical  and  experimental  studies  have  shown  the  importance  and 
existence  of  water  clusters  in  the  atmosphere.  While  thorough  studies  of  water  clusters 
with  other  important  sulfur  species  have  been  performed,18'24  we  are  unaware  of  any 
theoretical  work  on  carbon  disulfide  and  water  clusters.  Gaussian-3  (G3)  model 
chemistry25  has  proven  very  reliable  for  describing  water  cluster  structures  and 
thermochemistry,  as  benchmarked  against  experimental  and  high  level  ab  initio 
calculations.17'26'31  In  this  paper  we  present  the  structures  and  energies  for  CS2(H20)n 
complexes,  where  n=l-4,  at  298.15  K.  We  used  both  G3  model  chemistry  and  MP2 
theory  to  investigate  structure  and  thermochemistry,  and  compare  them  to  available 
experimental  results. 

Methods 

Multiple  structures  of  CS2(H20)„,  where  n=l-4,  were  built  with  SPARTAN.33  All 
structures  were  optimized  with  the  semi-empirical  PM3  method,34  and  unique  structures 
were  further  optimized  using  HF  theory  and  the  6-3 1G*  basis  set.35  Energies,  enthalpies, 
and  free  energies  were  obtained  with  the  G3  model  chemistry.  In  the  G3  model 


chemistry,  vibrational  zero-point  energy  is  obtained  with  a  geometry  optimization  at  the 
HF/6-31G*  level,  followed  by  scaling  the  frequencies  by  0.8929.  Then  geometries  are 
refined  with  an  MP2(full)/6-31G*  optimization,  followed  by  a  series  of  calculations  used 
for  various  energetic  corrections.25 

For  the  CS2(H20)„  clusters,  gas-phase  molecular  dynamics  (MD)  simulations 
were  performed  using  the  AMBER  8  suite  of  programs36  to  generate  a  wide  range  of 
input  structures.  In  the  first  step  of  each  MD  simulation,  the  temperature  was  gradually 
increased  from  5  K  to  the  production  temperature  over  one  nanosecond.  Production 
temperatures  were  40  K  except  where  n=l,  where  the  temperature  was  35  K..  The 
production  simulation  lasted  for  nine  nanoseconds.  The  non-bonded  cutoff  was  20  A,  and 
the  GAFF  force  field  was  used.37  The  water  molecules  were  assigned  to  the  TIP5P  water 
model38  to  ensure  correct  orientation  of  hydrogen  bonding.  The  ensemble  of  structures 
from  each  simulation  was  examined,  and  while  the  simulation  generated  no  new 
structures  where  n=l-3,  the  simulations  produced  two  novel  CS2(H20)4  configurations. 

The  MP2(full)/6-3 1G*  geometries  from  the  G3  output  were  used  as  input 
structures  for  optimizations  and  calculations  with  the  MP2/aug-cc-pVDZ  model.39  Single 
point  energy  calculations  were  then  carried  out  using  the  aug-cc-pVTZ  basis  set  on  the 
double  zeta  geometries.  MP2/aug-cc-pVTZ  geometry  optimizations  were  performed  on 
the  two  CS2(H20)  and  three  of  the  CS2(H20)2  clusters.  Anharmonic  frequencies40  of  the 
lowest  energy  dimer  were  calculated  at  the  MP2/aug-cc-pVDZ  level.  Scaled  harmonic 
frequencies  of  the  most  likely  clusters  to  be  observed  in  a  low-temperature  molecular 
beam  experiment  were  also  calculated,  using  a  scale  factor  of  0.9604  for  the  MP2/aug-cc- 
pVDZ  minima.41 


Results 

We  observed  significant  differences  between  the  G3  model  chemistry  and 
MP2/aug-cc-pVDZ  and  MP2/aug-cc-pVTZ  results.  MP2/aug-cc-pVDZ  and  MP2/aug-cc- 
pVTZ  calculations  obtained  a  structure  for  CS2(H20)  similar  to  the  single  experimental 
structure  found  by  Ogata  and  Lovas,32  while  the  G3  calculations  gave  a  very  different 
lowest-energy  structure.  Figure  1  exemplifies  this  by  showing  how  the  HF  geometry 
optimized  structure  for  a  CS2(H20)2  cluster  in  step  1  of  the  G3  calculation  changes 
slightly  in  step  3  of  the  G3  procedure,  which  is  an  MP2(full)/6-3 1 G*  optimization.  More 
importantly,  the  G3  MP2(full)/6-31G*  geometry,  labeled  CnG3,  changes  dramatically 
once  the  basis  set  is  increased.  The  MP2/aug-cc-pVDZ  structure  that  results,  labeled 
B„mp2,  is  significantly  different  than  the  MP2(full)/6-31G*  starting  structure.  This  is  a 
general  problem  with  all  of  the  minima  located  by  these  different  methods;  the  MP2/aug- 
cc-pVDZ  structures  vary  significantly  from  the  MP2(full)/6-31G*  structures,  which 
affects  the  thermochemistry  results.  Selected  optimizations  at  the  MP2/aug-cc-pVTZ 
level  for  n=l  and  n=2  structures  support  the  aug-cc-pVDZ  structures.  The  MP2(full)/6- 
3 1G*  structures  generated  as  part  of  the  G3  model  chemistry  can  be  found  in  figures  Sl- 
S4  in  the  supplementary  information,42  while  the  MP2/aug-cc-pVDZ  structures  are 
presented  in  figures  2-5.  We  found  two  dimers,  six  trimers,  six  tetramers,  and  two 
pentamers.  The  structures  are  named  to  show  order  of  stability,  number  of  waters,  and 
optimization  method.  The  letters  A-F  in  the  figures  signify  stability,  with  A  being  the 
most  stable  according  to  the  free  energy  at  298  K.  The  subscript  Roman  numeral  signifies 
the  number  of  waters  in  the  complex,  and  the  superscripts  G3  and  MP2  describes  the 
method  used  to  determine  the  structure  (G3:  MP2(full)/6-31G*  optimization;  MP2: 
MP2/aug-cc-pVDZ  optimization).  The  thermochemical  values  for  these  structures  can  be 


found  in  Table  1.  Table  2  contains  the  free  energies,  equilibrium  constants,  and  number 
of  complexes  per  cubic  centimeter  predicted  to  be  present  in  the  atmosphere  on  a  humid 
warm  day.  This  assumes  a  concentration  of  CS2  of  3.17  x  10'12  mobL'1  (1.91  x  109 
molecules'cm’3)3  and  a  concentration  of  water  of  1.54  x  10’3  mobL'1  (9.30  x  1017 
molecules*cm"3), 17,43  and  a  temperature  of  298  K.  Scaled  harmonic41  and  anharmonic 
frequencies  for  structure  A|MP2  can  be  found  in  Table  3.  Tables  4-7  contain  the  MP2/aug- 
cc-pVDZ  scaled  harmonic  frequencies  for  the  lowest  energy  clusters;  those  that  are  most 
likely  to  be  observable  in  a  low-temperature  molecular  beam  experiment.  Table  8 
identifies  the  strongest  modes  from  these  tables,  modes  which  do  not  show  significant 
overlap  with  water  clusters.  This  table  should  aid  experimentalists  in  identifying  these 
clusters  in  their  laboratories. 


Table  I.  AEo,  AH°298,  and  AG°298,  for  Incremental  Association  Energies  using  G3  and  MP2  Model  Chemistries  at  298.15  K.* 


G3 

MP2/aug-cc-pVDZb 

MP2/aug-cc-pVTZc 

n 

Structure 

AEo 

AH°298 

AG°298 

Structure 

AEo 

AH°298 

AG°298 

AEo 

AH°298 

AG°298 

(A)  CS2(H20)n.,  +  (H20)  ■ 

-CS2(H20)n: 

Incremental  Association  Energetics 

1 

A|G3 

-0.86 

-0.40 

0.87 

A,mp2 

-0.22 

0.22 

4.17 

0.02 

0.45 

4.41 

B,03 

3.83 

-0.50 

3.21 

b,MP2 

1.38 

1.54 

6.01 

1.52 

1.68 

6.15 

2 

A  03 

An 

-3.79 

-3.92 

3.43 

A„MP2 

0.36 

1.03 

3.80 

0.64 

1.31 

4.09 

n  G3 

D|I 

-5.03 

-5.31 

3.90 

R  MP2 

Dll 

-1.01 

-1.38 

5.97 

-0.74 

-1.11 

6.24 

p  03 

G|| 

-3.33 

-3.48 

4.46 

n  MP2 

G|| 

-1.01 

-1.58 

6.80 

-0.69 

-1.26 

7.12 

n  03 

-4.20 

-4.48 

4.68 

n  MP2 

L>ii 

-0.99 

-1.56 

6.83 

-0.67 

-1.24 

7.15 

r  G3 
till 

-4.57 

-4.91 

4.72 

C  MP2 

D|| 

0.27 

-0.22 

7.74 

0.40 

-0.09 

7.87 

r  G3 

ni 

-0.95 

-0.78 

6.01 

P  Mp2 
t*u 

0.30 

-0.42 

8.58 

0.53 

-0.19 

8.81 

Gn  03 

-0.75 

-0.17 

6.03 

n  MP2 

Gn 

2.85 

3.02 

8.73 

2.97 

3.14 

8.85 

3 

A„,G3 

-8.96 

-10.07 

0.48 

A  MP2 

Am 

-2.79 

-4.03 

8.05 

-2.48 

-3.72 

8.26 

R  03 

Dm 

-7.77 

-8.24 

1.72 

R  MP2 

Dill 

-4.78 

-7.06 

8.20 

-4.31 

-6.59 

8.67 

p  G3 
'-'111 

-7.51 

-8.64 

1.78 

n  MP2 
'-III 

-3.80 

-6.11 

9.06 

-3.49 

-5.80 

9.37 

n  03 

Dm 

-7.51 

-8.67 

2.50 

n  MP2 

Dill 

-3.48 

-5.33 

9.50 

-3.22 

-5.07 

9.76 

c  G3 

Dill 

-3.61 

-3.66 

3.89 

C  Mp2 

Uni 

-2.82 

-4.71 

9.78 

-2.51 

-4.39 

10.09 

c  G3 

Till 

-0.46 

0.21 

4.40 

f„,MP2 

-3.38 

-5.74 

10.67 

-2.83 

-5.19 

11.22 

4 

A  03 

Aiv 

-10.10 

-10.79 

-0.99 

A,vMP2 

-5.76 

-7.81 

5.93 

-5.36 

-7.41 

6.33 

n  03 

D|V 

-9.96 

-10.68 

-0.40 

R  MP2 

D|V 

-5.20 

-7.28 

7.07 

-4.80 

-6.88 

7.47 

*A11  values  are  given  in  kilocalories  per  mole. 

bG3  output  served  as  input  for  MP2/aug-cc-pVDZ  optimizations  and  frequency  calculations. 
cMP2/aug-cc-pVDZ  output  served  as  input  for  MP2/aug-cc-pVTZ  single  self-consistent  field  calculations. 


Table  2.  Free  Energies*,  Equilibrium  Constants,  and  Number  of 
Complexes  per  Cubic  Centimeter  Predicted  in  the  Atmosphere  at  298. 1 5 


K. 


n  Structure 

AG°298 

K. 

Kc 

molec'cm'1 

CS2(H20)n.,  +  H20  - 

CS2(H20)n 

1  A,mp2 

4.41 

5.89  x  lO  4 

1.44  x  10'2 

4.25  x  104 

B,mp2 

6.15 

3.10  x  10'5 

7.59  x  10'4 

2.24  x  103 

Total 

4.48  x  104 

A  MP2 

2  An 

4.09 

1.01  x  10'3 

6.06  x  10 1 

1.79  x  106 

n  Mp2 
t>n 

6.24 

2.67  x  10’5 

1.60  x  lO  2 

4.72  x  104 

n  MP2 

t-11 

7.12 

6.03  x  10‘6 

3.61  x  10‘3 

1.06  x  104 

n  MP2 

Ull 

7.15 

5.71  x  10'6 

3.42  x  10  3 

1.01  x  104 

C  MP2 

Cll 

7.87 

1.69  x  10'6 

1.01  x  10'3 

2.99  x  103 

V  MP2 
rii 

8.81 

3.49  x  10'7 

2.09  x  10*4 

6.17  x  102 

r  MP2 

G|| 

8.85 

3.26  x  10'7 

1.95  x  10‘4 

5.76  x  102 

Total 

1.86  x  106 

“Free  energies  (kcal/mol)  calculated  with  MP2/aug-cc-pVTZ//MP2/aug-cc-pVDZ. 


Table  3.  MP2  Scaled  Harmonic  and  Anharmonic  Frequencies*  for  the 


CS2(H;0)  Structure  A, MP2. 


vibrational 

modeh 

MP2/aug-cc-pVDZ 

Harmonic 

intensities 

scaled" 

anharm 

IR 

R  Exp 

v»(a) 

3772 

3736 

m 

s 

vi(a) 

3645 

3616 

w 

vs 

v2(a) 

1559 

1575 

m 

m 

Vj(b) 

1532 

1576 

vs 

w 

v,(b) 

640 

648 

w 

vs 

in-plane  y>(b) 

364 

395 

w 

w 

out-of-plane  v^(b) 

363 

394 

w 

w 

rock 

95 

68 

m 

w 

wag 

88 

88 

s 

w 

intermol  stretch 

81 

73 

w 

w  67d 

out-of-plane  bend 

29 

40 

w 

m 

in-plane  bend 

24 

22 

w 

m 

*A11  values  in  cm1. 

bMode  motion:  Vi  for  symmetric  stretching,  v2  for  bending,  and  V3  for  asymmetric  stretching,  (a)  signifies  the  water  and  (b)  signifies 
the  carbon  disulfide. 

"Scaling  factor  for  MP2/aug-cc-pVDZ  was  0.9604.  Reference 41 . 
dReference  32. 


Table  4.  MP2/aug-cc-pVDZ  Scaled  Harmonic  Frequencies*  for  the 
CS2(H20)2  Structure  BuMP2. 

MP2/aug-cc-pVDZ 

Intensity 

vibrational  modeb 

scaled' 

IR 

v3(a) 

3763 

m 

v3(d) 

3742 

m 

vi(a) 

3638 

m 

v,(d) 

3571 

s 

V2(d) 

1577 

m 

Vz(a) 

1561 

m 

v3(c) 

1533 

vs 

vi(c) 

641 

w 

(o  (O-H  bonded  H  d),  twist  (a) 

586 

m 

in-plane  v^c) 

361 

w 

out-of-plane  V2(c) 

361 

w 

symmetric  p  (d),  a)  (a) 

338 

m 

asymmetric  p  (d),  w  (a) 

176 

s 

to  (S-H  bonded  H  d),  twist(a) 

155 

s 

p  (d),  to  (a) 

147 

m 

twist(a) 

130 

W 

P(  a) 

93 

W 

intermolecular  stretch  (ac) 

67 

W 

intermolecular  stretch  (dc) 

51 

W 

out-of-plane  intermolecular  bend 

20 

W 

in-plane  intermolecular  bend 

17 

W 

mi  values  a ic  in  cm  . 

bMode  motion:  V|  for  symmetric  stretching,  v2  for  bending,  v3  for  asymmetric  stretching,  to  for  wag,  and  p  for  rock,  (a)  signifies 
water  with  an  SO  interaction,  (d)  signifies  the  water  with  an  OH  -S  interaction,  and  (c)  signifies  the  carbon  disulfide. 
cMP2/aug-cc-pVDZ  scaling  factor  was  0.9604. 

Table  5.  MP2/aug-cc-pVDZ  Scaled  Harmonic  Frequencies*  for  the 

CS2(H20)2  Structure  C„MP2. 

MP2/aug-cc-pVDZ  Intensity 

Vibrational  mode 

scaled' 

IR 

v3(d) 

3754 

s 

v3(a) 

3744 

s 

vi(d) 

3622 

m 

vi(a) 

3535 

s 

v2(a) 

1578 

m 

v2(d) 

1559 

m 

v3(c) 

1530 

vs 

vi(c) 

639 

w 

to  (bonded  H  a) 

604 

s 

p  (a),  to  (bonded  H  d) 

414 

m 

in-plane  v^c) 

364 

m 

out-of-plane  V2(c) 

362 

w 

asymmetric  p  (d),  to  (a) 

209 

vs 

to  (free  H  d),  twist  (a) 

176 

m 

intermolecular  stretch(da) 

170 

m 

nonconcerted  to  (free  H  a),  p  (d) 

122 

s 

concerted  to  (free  H  a),  p  (d) 

94 

m 

intermolecular  stretch 

80 

w 

intermolecular  stretch 

75 

m 

out-of-plane  intermolecular  bend 

29 

w 

in-plane  intermolecular  bend 

21 

w 

“All  values  are  in  cm'1. 

bMode  motion:  V|  for  symmetric  stretching,  v2  for  bending,  v3  for  asymmetric  stretching,  to  for  wag,  and  p  for  rock,  (a)  signifies 

water  with  an  S -  0  interaction,  (c)  signifies  the  carbon  disulfide,  and  (d)  signifies  the  water  with  an  OH  -  C  interaction. 

cMP2/aug-cc-pVDZ  scaling  factor  was  0.9604. 

Table  6.  MP2/aug-cc-pVDZ  Scaled  Harmonic  Frequencies*  for 


^^n2u»  ouu^iuic  mu 

vibrational  mode 

MP2/aug-cc-pVDZ 

Intensity 

scaledc 

IR 

Vj(n) 

3736 

s 

v3(a) 

3730 

m 

v3(d) 

3713 

m 

v3(  bonded  H  d/n) 

3489 

vs 

v3(bonded  H  d/an) 

3479 

vs 

V|  (bonded  H  dan) 

3422 

m 

v2(dan) 

1595 

m 

v^dn/a) 

1569 

m 

vi(d/an) 

1567 

s 

v3(c) 

1538 

vs 

a)  (dan) 

836 

w 

V|(C) 

642 

w 

w  (an) 

636 

vs 

a)  (da) 

551 

s 

P  (da) 

437 

s 

in-plane  v^(c) 

357 

m 

out-of-plane  V2(c) 

353 

w 

p  (dn/a) 

342 

m 

p  (dan) 

334 

m 

a)  (dna) 

240 

m 

a)  (an/d) 

215 

m 

heavy  atom  ring  stretch 

211 

m 

heavy  atom  ring  distortion 

193 

m 

heavy  atom  ring  distortion 

174 

m 

heavy  atom  ring  distortion 

173 

w 

Intermolecular  stretch(ce) 

68 

w 

Intermolecular  bend(ce) 

44 

w 

torsion(e) 

39 

w 

twist(e) 

17 

w 

intermolecular  bend(ce) 

9 

w 

*A11  values  are  in  cm'1 

hMode  motion:  Vi  for  symmetric  stretching,  v2  for  bending,  v3  for  asymmetric  stretching,  10  for  wag,  p  for  rock,  and  /  separates  non- 
concerted  motions,  (a)  signifies  the  water  with  an  SO  interaction,  (d)  signifies  the  water  with  OH-S  interaction,  (n)  signifies  the 
water  with  no  direct  interaction  with  CS2,  (c)  signifies  the  carbon  disulfide,  and  (e)  signifies  the  water  trimer. 
cMP2/aug-cc-pVDZ  scaling  factor  was  0.9604. 
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hMode  motion:  V|  for  symmetric  stretching,  v2  for  bending,  v3  for  asymmetric  stretching,  co  for  wag,  p  for  rock,  and  /  separates  non- 
concerted  motions,  (al)  signifies  the  water  with  C-0  interaction,  (a2)  signifies  the  water  with  S— O  interaction,  (c)  signifies  the 
carbon  disulfide,  (dl)  signifies  the  water  that  donates  to  al,  (d2)  signifies  the  water  that  donates  to  a2,  and  (f)  signifies  the  water 
tetramer. 

cMP2/aug-cc-pVDZ  scaling  factor  was  0.9604. 


Table  8:  Intense  MP2/aug-cc-pVDZ  Scaled  Infrared  Absorptions  for  CS2(H20)n  Clusters 
Predicted  to  be  Present  and  Observable  in  a  Molecular  Beam  Low-Temperature 
Experimental  Apparatus.8 
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Figure  1.  Structures  shown  left  to  right:  HF/6-31G*  optimized  structure  from  step  1  of  the  G3  model 
chemistry,  MP2(full)/6-31G*  optimized  structure  from  step  3  of  the  G3  model  chemistry  (CnG3),  and 
MP2/aug-cc-pVDZ  optimized  structure  (BnMP2).  The  HF  structure  was  used  as  input  for  the  MP2(full) 
optimization  that  produced  CnG3,  and  the  MP2(full)  CnG3  structure  was  used  as  input  that  produced 
the  BnMP2  structure. 
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Figure  2.  MP2/aug-cc-pVDZ  geometries  of 
CS2(H20). 
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Figure  3.  MP2/aug-cc-pVDZ  geometries  of 
CS2(H20)2. 
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Figure  4.  MP2/aug-cc-pVDZ  geometries  of 
CS2(H20)3. 


Figure  5.  MP2/aug-cc-pVDZ  geometries  of 
CS2(H20)4. 


Discussion 

Structures. 

Two  types  of  structures  were  found  for  CS2(H20):  a  linear  alignment  of  the  heavy 
atoms  found  in  A|MP2  and  BG3,  and  the  T-shape  found  in  B|MP2  and  AiG3.42  The  linear 
geometry  is  similar  to  the  experimental  structure,  obtained  via  Fourier  transform 
microwave  spectroscopy.32  In  the  experimental  structure  the  S  -0  distance  is  3.197  A, 
while  the  MP2/aug-cc-pVDZ  distance  is  3.1 16  A  and  the  MP2/aug-cc-pVTZ  distance  is 
3. 1 19  A.  Unlike  the  experimental  structure,  the  MP2  structures  have  the  hydrogen  atoms 
in  the  same  plane  as  the  heavy  atoms.  In  the  experiment,  the  S--0-H  angle  is  37  degrees, 
which  was  determined  assuming  C2V  symmetry  of  the  complex,  no  perturbation  of  the 
individual  monomers,  and  that  the  S-C-S—O  geometry  is  linear.  Two  spectroscopic 
constants,  B  and  C,  were  determined  from  the  microwave  spectrum.  The  values  of 
1030.1 109  MHz  and  1026.2912  MHz  are  modeled  fairly  well  by  the  MP2/aug-cc-pVDZ 
results,  which  are  1029.83  MHz  and  1027.40  MHz.  The  value  of  A  was  assumed  to  be 
366.4  GHz  because  it  was  the  best  value  for  the  least-squares  fitting  procedure,  although 
the  authors  comment  that  for  a  C2V  planar  equilibrium  structure  that  they  expected  the  A 
rotational  constant  to  be  equal  to  437  GHz  (the  B  rotational  constant  for  water).32  The 
MP2/aug-cc-pVDZ  value  for  A  is  428  GHz.  The  experimental  dipole  moment  is  2.078 
Debye,  while  the  calculated  result  is  2.802  Debye. 

The  linear  structure  using  the  G3  method,  AiG3,  is  not  the  theoretical  global 
energy  minimum,  whereas  the  global  minimum  for  the  MP2/aug-cc-pVDZ  method  is  the 
linear  structure,  A|MP2.  The  Ai°3  structure,  (Figure  SI  in  supplementary  information),  is 
qualitatively  similar  to  the  BiMP2  structure  displayed  in  Figure  2.  While  qualitatively  the 
structures  are  similar,  the  two  methods  produce  different  bond  lengths  and  angles.  In  the 


T-shaped  geometries,  the  S-0  distances  for  Ai°3  and  BiMP2  are  2.972  A  and  2.754  A, 
respectively.  In  addition  to  the  different  interaction  distances,  the  S— H-0  angle  in  the  T- 
shaped  structures  is  125°  for  the  MP2(full)/6-31G*  geometry  and  156°  for  the  MP2/aug- 
cc-pVDZ  structure.  The  addition  of  diffuse  functions  in  the  geometric  optimizations 
account  for  the  structural  differences,  and  for  the  reversal  of  stability,  by  significantly 
altering  the  enthalpy  and  entropy  of  interaction. 

The  six  structures  for  CS2(H20)2  were  composed  of  either  the  low  energy  water 
dimer26  or  two  separate  water  molecules.  The  G3  MP2(full)/6-31G*  optimized 
geometries  underwent  significant  structural  changes  upon  the  addition  of  diffuse 
functions  to  the  basis  set.  Structures  An03  and  CnG3  (Figure  S2)  each  contained  water 
dimers,  one  perpendicular  to  the  linear  CS2  molecule  and  the  other  extending  away  from 
one  of  the  sulfur  atoms.  After  MP2  optimization  with  diffuse  functions,  these  waters 
reoriented  to  form  the  five-membered  ring  structures  CnMP2  and  BnMP2  (Figure  3).  The 
global  minima  for  both  methods  are  similar  to  their  respective  global  minima  for 
CS2(H20).  An03  (Figure  S2)  consists  of  a  four-membered  ring  with  an  extra  dangling 
water,  and  AnMP2  is  a  linear  structure  with  a  S— O  interaction  on  each  end  similar  to  that 
of  AiMP2.  The  diffuse  functions  also  affected  the  orientation  of  the  free  (non-interacting) 
hydrogens  in  the  MP2/aug-cc-pVDZ  optimizations.  For  the  AnMI  2,  BnMP2,  Q|MP‘,  and 
F„MP2  structures,  a  free  hydrogen  of  one  or  more  water  molecules  is  in  the  plane  defined 
by  the  heavy  atoms. 

Optimizations  of  CS2(H20)3  resulted  in  two  common  formations  of  the  water 
molecules:  a  linear  water  trimer  and  a  cyclic  water  trimer,  similar  to  the  (H20)3  global 
energy  minimum.26,28  In  AmG3,  the  cyclic  water  trimer  spans  across  CS2  to  create  a  S- -O 


van  der  Waals  interaction  with  each  S  atom.  The  global  minimum  from  MP2/aug-cc- 
pVDZ  calculations,  AmMP2,  contains  a  linear  water  trimer  bound  to  one  sulfur  atom.  The 
C„MP2  structure  is  composed  of  the  water  trimer  poised  above  CS2.  In  G3  methodology 
the  cyclic  water  trimer  is  a  key  feature  of  these  complexes’  stability  as  three  of  the  four 
lowest  energy  structures  contain  the  cyclic  water  trimer.  In  the  MP2/aug-cc-pVDZ 
calculations,  however,  the  cyclic  water  trimer  no  longer  appears  in  the  global  minimum 
for  CS2(H20)3.  In  the  structures  without  cyclic  water  trimers,  the  MP2/aug-cc-pVDZ 
optimizations  placed  some  non-interacting  hydrogens  in  the  plane  with  the  heavy  atoms 
of  the  complex,  as  shown  in  AmMP2,  DhiMP2,  and  EmMP2.  The  G3  calculations,  however, 
show  no  tendency  towards  a  planar  orientation  of  hydrogen  in  any  structures.  Instead, 
geometries  such  as  E  mG3  and  FmG3  (Figure  S3)  have  all  non-interacting  hydrogens  out  of 
the  plane  of  the  heavy  atoms. 

CS2(H20)4  shows  two  nearly  identical  structures  optimized  with  G3  and 
MP2/aug-cc-pVDZ.  All  structures  contained  the  cyclic  S4  water  tetramer28  centered  over 
one  of  the  sulfur  atoms.  The  AiVmp2  minimum  has  one  of  the  tetramer’s  hydrogen  bonds 
passing  over  CS2.  The  C— O  interaction  distance  in  this  geometry  is  3.382  A.  The  B|VMP2 
structure  has  one  water  of  the  tetramer  above  the  CS2,  with  an  S  -  H  interaction  distance 
of  3.450  A. 

Thermochemistry. 

Calculations  for  CS2(H20)n,  where  n=l-4,  gave  very  different  thermochemical 
results  from  the  two  methods,  G3  and  MP2/aug-cc-pVDZ.  The  differences  are  clear  in 
Table  1  as  some  nearly  identical  structures  have  very  different  thermochemistry.  The  G3 
results  have  to  be  discounted,  as  the  first  optimization  in  the  G3  method  uses  the  HF/6- 


3 1G*  method  for  the  frequency  information  essential  for  the  zero-point  and  entropic 
corrections,  while  the  second  uses  the  MP2(fulI)/6-31G*  method.  These  two  structures 
are  often  different  from  each  other,  and  as  shown  in  the  previous  section  the  lack  of 
diffuse  functions  in  the  second  geometry  optimization  produces  quite  different  structures 
compared  to  MP2/aug-cc-pVDZ  method.  Single  point  energy  calculations  with  the  triple 
zeta  basis  set  on  the  double  zeta  basis  set  geometries  show  a  clear  pattern,  giving 
confidence  that  further  increases  of  the  basis  set  would  lead  to  energy  convergence. 

The  values  in  Table  1  show  that  the  combination  of  a  water  molecule  with  a 
carbon  disulfide  molecule  to  form  the  most  stable  complex  is  only  slightly  endothermic; 
including  entropy  results  in  a  4.4  kcalmol'1  free  energy  for  formation  of  this  complex  at 
298  K.  Addition  of  another  water  to  the  A  iMP2  structure  to  form  the  A  uMP2  structure 
requires  a  free  energy  of  4.1  kcal  mol'1  at  298  K.  Building  on  the  A  nMP2  structure  to  form 
the  A  iiiMP2  structure  is  exothermic  by  3.7  kcal  mol'1  because  of  hydrogen  bonding 
between  the  waters,  but  requires  a  free  energy  of  8.3  kcal  mol'1  because  of  the  entropic 
cost  of  ordering  the  three  water  molecules  on  one  side  of  the  carbon  disulfide  molecule. 
Adding  one  more  water  to  produce  the  A|vMP2  structure  is  exothermic  by  7.4  kcal  mol1, 
and  requires  6.3  kcal  mol'1  of  free  energy.  The  S4  water  tetramer  is  quite  stable,  releasing 
much  heat,  but  at  the  cost  of  greater  entropy.28 

We  note  that  more  accurate  thermochemical  values  could  be  obtained  by  using  the 
MP2/aug-cc-pVDZ  geometries  and  frequencies  as  starting  points  for  a  G3-like 
calculations,  or  by  calculating  single-point  energies  at  the  CCSD(T)/aug-cc-pVnZ  levels 
of  theory.  In  addition,  corrections  for  Basis  Set  Superposition  Error  would  improve  the 
thermochemical  values.  Such  calculations  would  improve  the  absolute  numbers  but  most 


likely  not  change  the  trends,  so  that  the  MP2/aug-cc-pVTZ  values  effectively  capture  the 
minimum  energy  structures. 

Based  on  the  MP2/aug-cc-pVTZ  free  energies  for  forming  the  complexes  in  Table 
1 ,  the  concentration  of  saturated  water  in  the  atmosphere,  and  the  average  amount  of  CS2 
in  the  atmosphere,  the  concentrations  of  the  different  dimer  and  trimer  clusters  are 
predicted  in  Table  2.  We  predict  that  under  these  conditions  at  298  K,  that  104  CS2(H20) 
clusterscm'3  and  106  CS2(H20)2  clusterscm'3  are  present.  Under  the  same  conditions,  it 
has  been  predicted  that  the  total  number  of  water  dimers,  trimers,  tetramers,  pentamers, 
and  hexamers  is  on  the  order  of  1014,  1012,  10",  1010,  and  104  clusters  cm'3  respectively.17 
Frequencies. 

Table  3  contains  the  MP2/aug-cc-pVDZ  anharmonic  and  scaled  harmonic 
frequencies  for  the  AiMP2  CS2(H20)  cluster.  The  anharmonic  frequencies  are 
computationally  expensive,  and  it  is  reassuring  that  there  is  fairly  good  agreement 
between  the  scaled  harmonic  and  anharmonic  frequencies  in  the  intermolecular  infrared 
region  below  1000  cm'1.  Ogata  and  Lovas  used  FT  microwave  spectroscopy  to  determine 
the  intermolecular  stretch  of  the  CS2(H20)  cluster  at  67  cm'1.  This  frequency  corresponds 
to  the  calculated  values  of  81  cm'1  (scaled  MP2)  and  73  cm'1  (anharmonic  MP2)  in  Table 
3.  The  strongest  intensity  bands,  according  to  the  MP2  scaled  harmonic  data,  are  at  1532 
and  88  cm'1.  These  two  modes  correspond  to  the  asymmetric  stretch  of  carbon  disulfide 
(red-shifted  by  10  cm'1)  and  a  wagging  motion  of  the  intermolecular  complex.  The  88 
cm'1  peak  is  in  a  unique  region  of  the  spectrum  relative  to  water  clusters.31  There  is  a 
weak  intensity  vibration  for  the  water  tetramer  predicted  at  66  cm'1,  and  a  few  weak 
peaks  below  50  cm'1  for  the  water  pentamer,  and  all  of  the  other  intermolecular  modes  for 


the  water  dimer,  trimer,  tetramer,  and  pentamer  are  predicted  to  be  above  140  cm'1.  Thus 
we  expect  that  this  peak  could  be  observed  experimentally. 

Based  on  the  values  for  AEo  in  Table  1,  which  approximate  the  values  for  AGsk  in 
a  cold  molecular  beam,  we  would  expect  the  A|,  Bn,  Cn,  Cm,  and  Aiv  species  to 
predominate  in  a  low-temperature  experiment.  Tables  4-8  contain  the  scaled  harmonic 
frequencies  for  these  clusters.  Each  vibrational  mode  is  described  in  each  table,  along 
with  the  predicted  frequency  and  relative  IR  intensity.  We  have  previously  calculated 
scaled  harmonic  and  anharmonic  frequencies  for  water  clusters  from  the  dimer  up  to  the 
pentamer,31  and  we  have  contrasted  this  data  with  the  data  in  tables  4-8  to  determine  the 
most  likely  region  of  the  infrared  spectrum  to  observe  the  CS2(H20)n  clusters  in  a 
laboratory  experiment.  Focusing  on  the  intermolecular  region,  which  has  the  least  overlap 
with  water  clusters  and  the  most  reliable  predicted  scaled  harmonic  frequencies,  we  have 
listed  the  most  likely  frequencies  to  observe  these  clusters  in  Table  8.  The  CnMP2  structure 
has  a  strong  intensity  mode  that  can  best  be  described  as  a  non-concerted  wag  of  the  non¬ 
interacting  hydrogen  atom  of  the  water  making  an  OH— S  interaction  and  a  rocking 
motion  of  the  water  making  an  OH— C  interaction  at  122  cm'1.  This  frequency  lies  in 
between  the  A|MP2  wag  at  88  cm'1  and  the  medium  intensity  water  dimer  mode  at  142  cm' 
’,  and  should  be  detectable  in  a  laboratory  experiment.  There  are  a  few  weak  modes  in  the 
128-136  cm'1  range  for  the  water  pentamer,31  but  at  low  water  pressures  the  Cu  frequency 
should  be  much  stronger.  The  CmMP2  structure  has  a  strong  intensity  wagging  motion  of 
the  water  that  makes  the  OH— S  interaction  and  the  water  that  makes  the  S— O  interaction 
at  437  cm’1.  This  vibration  is  in  a  relatively  clear  part  of  the  water  cluster  IR  spectrum, 
and  the  closest  possible  obfuscation  of  this  peak  would  be  from  a  predicted  medium 


intensity  water  trimer  peak  at  426  cm'1.31  Carbon  disulfide  has  a  weak  intensity  scaled 
MP2/aug-cc-pVDZ  bending  mode  at  354  cm'1,  so  the  Cm  complex's  frequency  is  blue- 
shifted  by  approximately  80  cm1.  The  AivMP2  structure  has  four  modes  that  have  the 
potential  to  be  observed  in  the  laboratory.  Four  different  wagging  motions  involving 
various  parts  of  the  intermolecular  complex  are  predicted  to  be  strong  or  very  strong 
intensity  peaks  at  245,  252,  723,  and  787  cm'1.  Carbon  disulfide  has  a  weak  intensity 
scaled  MP2/aug-cc-pVDZ  symmetric  stretch  at  643  cm'1,  so  the  Aiv  complex's  higher 
frequencies  are  red-shifted  by  approximately  80  and  145  cm'1.  The  water  trimer  has  a 
weak  intensity  peak  predicted  at  229  cm'1,  the  water  pentamer  has  two  weak  intensity 
peaks  predicted  at  239  and  240  cm'1,  and  a  medium  intensity  peak  predicted  to  be  appear 
at  727  cm'1.31  Variation  of  water  pressure  and  observation  of  the  225-275  and  700-800 
cm  1  regions  of  the  spectrum  may  allow  for  the  detection  of  the  Aiv  cluster. 

Methodology. 

Despite  the  failures  in  the  present  work  on  van  der  Waals  clusters  of  CS2  and 
water,  the  G3  model  chemistry  is  an  excellent  predictor  of  structures  and 
thermochemistry  for  pure  water  clusters  and  ion/water  clusters.17  26'31  G3  does  not  use 
diffuse  functions  in  its  geometric  optimizations,  an  inherent  weakness  for  calculations  of 
van  der  Waal's  clusters  governed  by  London  dispersions  forces.  The  HF/6-3 1G*  and 
MP2(full)/6-31G*  optimizations  cannot  describe  the  long-range  interactions  with  the 
third  period  sulfur  atom.  The  G3  thermochemical  calculations  include  various  correcting 
factors  for  diffuse  functions,  polarization  functions,  and  other  effects,  which  are  all  based 
on  the  MP2(full)/6-31G*  optimized  structure.25  As  the  MP2(full)/6-31G*  global 
minimum  for  the  G3  calculations  does  not  match  the  linear  experimental  structure  for 


CS2(H20),  or  the  MP2/aug-cc-pVDZ  structures  for  the  CS2(H20)n  larger  clusters,  this 
model  chemistry  is  not  suitable  for  van  der  Waals  complexes  with  sulfur,  particularly 
CS2.  We  note  that  a  G3-like  calculation  could  be  performed  starting  with  the  MP2/aug- 
cc-pVDZ  geometry  and  frequencies,  which  would  most  likely  yield  very  good  results. 

Conclusion 

We  have  performed  an  extensive  molecular  dynamics  search  for  structures  of 
CS2(H20)n,  where  n=l-4,  locating  two  dimers,  six  trimers,  six  tetramers,  and  two 
pentamers  with  the  G3  model  chemistry  and  with  MP2/aug-cc-pVDZ  geometry 
optimizations.  The  MP2/aug-cc-pVDZ  results  match  best  with  the  experimental  result  for 
the  CS2(H20)  dimer,  showing  that  diffuse  functions  are  necessary  to  model  the 
interactions  found  in  this  complex.  The  MP2(full)/6-31G*  global  minimum  of  CS2(H20) 
in  the  G3  calculations  does  not  match  the  linear  experimental  structure  for  CS2(H20),  or 
the  MP2/aug-cc-pVDZ  structure.  In  addition,  for  the  larger  CS2(H20)„  clusters,  the 
MP2/aug-cc-pVDZ  minima  are  significantly  different  from  the  MP2(full)/6-3 1G* 
structures.  We  conclude  that  the  G3  model  chemistry  is  not  suitable  for  investigation  of 
van  der  Waals  complexes  with  sulfur.  Based  on  the  MP2/aug-cc-pVTZ  free  energies,  the 
concentration  of  saturated  water  in  the  atmosphere,  and  the  average  amount  of  CS2  in  the 
atmosphere,  the  concentrations  of  CS2(H20)  and  CS2(H20)2  clusters  are  predicted  to  be 
on  the  order  of  104  CS2(H20)  clusterscm'3  and  106  CS2(H20)2  clusters  cm  3  at  298  K.  The 
most  abundant  clusters  at  298  K  are  not  necessarily  the  most  abundant  found  in  a  cold 
molecular  beam,  and  we  predict  that  the  AiMP2,  CuMP2,  CmMP2,  and  AivMP2  structures 
should  be  observable  in  the  intermolecular  infrared  frequency  ranges. 
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It  has  been  speculated  that  the  presence  of  0H(H20)„  clusters  in  the  troposphere  could  have  significant  effects 
on  the  solar  absorption  balance  and  the  reactivity  of  the  hydroxyl  radical.  We  have  used  the  G3  and  G3B3 
model  chemistries  to  model  the  structures  and  predict  the  frequencies  of  hydroxyl  radical/water  clusters 
containing  one  to  five  water  molecules.  The  reaction  between  hydroxyl  radical  clusters  and  methane  was 
examined  as  a  function  of  water  cluster  size  to  gain  an  understanding  of  how  cluster  size  affects  the  hydroxyl 
radical  reactivity. 


Introduction 

The  hydroxyl  radical  is  the  most  important  reactive  species 
in  the  troposphere.  Described  as  the  atmospheric  “vacuum 
cleaner,"  the  hydroxyl  radical  is  responsible  for  many  of  the 
reactions  that  remove  volatile  organic  compounds  (VOCs)  from 
the  air.1  The  hydroxyl  radical  has  a  short  lifetime  in  the 
atmosphere  and  is  regenerated  constantly.  It  has  a  global  average 
concentration  of  106  molecules*cm-3  during  the  daylight  hours, 
dropping  at  night  once  photodissociation  pathways  that  generate 
OH  cease.1  Because  the  hydroxyl  radical  acts  as  a  major  sink 
for  many  VOCs,  its  concentration  influences  the  balance  of 
many  atmospheric  species.  For  example,  this  radical  oxidizes 
approximately  83%  of  annual  methane  emissions,  supporting 
the  fact  that  the  hydroxyl  radical  is  one  of  the  most  important 
processors  of  greenhouse  gases.  Methane  concentration  is 
increasing  in  the  atmosphere  because  anthropogenic  sources  are 
more  than  double  natural  methane  sources,  and  sources  out¬ 
number  sinks  by  35-40  Tg*yr-1J 

Because  it  is  known  to  form  singly  hydrated  complexes,2  3 
the  role  of  the  0H(H20)  dimer  and  higher  order  clusters  has 
been  speculated  to  have  an  effect  on  atmospheric  chemistry. 
The  formation  of  hydrogen  bonds  is  known  to  affect  the  spectral 
features  and  reactivity  of  the  constituent  monomers.4  Cluster 
formation  results  in  spectral  peak  broadening,  peak  shifts,  and 
the  addition  of  intermolecular  vibrational  modes  that  occur 
below  1000  cm-1.  The  0H(H20)  dimer,  recently  observed  by 
rotational  spectroscopy,2,3  is  proposed  as  an  intennediate  in  the 
interconversion  of  OH  and  H20,5  and  has  been  predicted  to  be 
a  stronger  oxidant  than  free  OH.6  The  presence  of  0H(H20)„ 
clusters  could  have  an  effect  on  the  solar  absorption  balance, 
resulting  from  the  presence  of  new  absorption  bands  (<1000 
cm-1)  added  to  the  atmospheric  spectrum.  Additionally,  the 
larger  dipole  moment  of  the  complex  enhances  the  intensity  of 
absorption.7  In  a  recent  review  article,  Schrems  and  co-workers 
argue  that  both  the  hydroxyl  radical’s  solar  absorption  balance 
and  reactivity  could  be  greatly  affected  by  complexation  with 
water.4  We  have  undertaken  this  study  to  better  understand  the 
role  that  OH(H20)„  clusters  could  have  on  atmospheric  absorp¬ 
tion  and  reactivity  with  VOCs. 
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Methods 

The  G3  model  chemistry,8  used  previously  to  accurately 
model  structures  and  energetics  for  gas-phase  water  clusters 
composed  of  2— 8  water  molecules,9-12  ion— water  clusters, 13,14 
and  hydroperoxy  radical— water  complexes,15  has  been  used  to 
model  the  hydroxyl  radical— water  complexes.  Clusters  were 
built  in  SPARTAN,16  optimized  using  the  PM3  method,17  and 
followed  by  HF/6-31G*  self-consistent  field  optimizations.18 
These  structures  served  as  input  for  G3  calculations  performed 
using  Gaussian03  versions  B.02  and  C.02.19  Harmonic  frequency 
calculations  were  performed  on  all  dimer,  trimer,  tetramer, 
pentamer,  and  hexamer  structures  using  the  HF/6-31G*  method. 
These  frequencies  were  scaled  by  0.8929  to  obtain  reliable 
frequency  estimates  for  the  lowest  energy  structures  because 
scaled  HF/6-31G*  frequencies  have  previously  been  shown  to 
be  in  good  agreement  with  MP2  anharmonic  and  experimental 
values  for  water  clusters.20  To  understand  the  effects  of  clusters 
on  reactivity,  a  simple  reaction  between  hydroxyl  radical  clusters 
and  methane  was  examined.  Reactants,  transition  states,  and 
products  were  calculated  using  the  G3B3  method21  because  only 
one  geometry  optimization  is  required  with  this  model  chemistry. 
In  addition  to  the  model  chemistry  calculations,  the  BHandll- 
LYP  DFT  functional  was  used  with  the  6-31 1G**  and  aug-cc- 
pVTZ  basis  sets  to  calculate  geometries  and  harmonic  frequen¬ 
cies  for  the  methane  and  OH  radical  reaction  pathway.  A  set  of 
single  self-consistent  field  calculations  at  the  CCSD(T)/aug- 
cc-pVTZ  level  were  made  on  the  BHandHLYP/aug-cc-pVTZ 
geometries.  Kinetic  rate  constants  were  obtained  using  ThcRate 
on-line  program22  using  simple  transition-state  theory  with  and 
without  Eckart  tunneling.18,23,24 

Results 

The  free-energy  minima  at  298  K  for  the  dimer,  trimer, 
tetramer,  pentamer,  and  hexamer  are  presented  in  Figure  1.  The 
relevant  O— H  bond  lengths  and  hydrogen  bond  lengths,  angles, 
and  other  geometric  parameters  can  be  found  in  the  Supporting 
Information.  In  addition,  the  remaining  local  minima  for  OH- 
(H20)„  n  =  2—5  clusters  can  be  found  in  the  Supporting 
Information.  Table  1  contains  the  G3  energetics  in  kcabmol-1 
for  the  association  of  cyclic  water  clusters  with  a  hydroxyl 
radical  and  for  the  association  of  hydroxyl  radical- water  clusters 
with  one  additional  water  molecule.  Our  reported  energetics 
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Figure  1.  G3  free-energy  minima  at  298  K  for  OH(H20)„  clusters,  n 
=  1-5. 


include  the  electronic  energies  of  binding  corrected  for  zero- 
point  energy  (AZTo),  the  same  energies  including  thermal 
corrections  (A£°298),  and  the  enthalpies  (A/T^s)  and  Gibbs  free 
energies  (AG°29»)  at  298  K.  Reaction  i  of  Table  1  gives  the 
association  energetics  for  the  formation  of  a  hydroxyl  radical- 
water  cluster  from  a  pure  water  cluster  and  free  hydroxyl  radical. 
Reaction  ii  of  Table  1  gives  the  energetics  for  the  sequential 
addition  of  water  molecules  to  hydroxyl  radical— water  clusters. 

The  equilibrium  constants  for  a  standard  state  of  1  M  (Ac), 
molarity  (A/),  and  the  number  of  clusters/cm3  predicted  to  be 
present  in  the  lower  troposphere  on  a  humid  day  are  presented 
in  Table  2.  These  values  are  based  on  the  Gibbs  free  energy 
for  the  association  of  the  most  stable  0H(H20)„-i  structure  with 
each  successive  water  (Table  1,  reaction  ii).  Table  3  contains 
the  results  for  the  DFT  calculations  and  for  the  CCSD(T) 
calculations  for  the  reaction  of  OH  and  CH4,  from  starting 
reactants  to  final  products,  including  pre-reactive  and  post¬ 
reactive  complexes,  as  well  as  the  transition  state  for  the 
reaction.  Figure  2  illustrates  the  two-dimensional  slice  of  the 
potential  energy  surface  for  this  reaction  using  the  data  in  Table 
3.  Table  4  contains  AEo,  AE°298,  A//°298,  and  AG°298  obtained 
using  the  G3B3  method  for  the  reaction  of  methane  with  the 
OH  radical,  and  the  low  energy  Civ  dimer,  cyclic  trimer,  cyclic 
tetramer,  cyclic  pentamer,  and  cyclic  hexamer.  This  table  also 
contains  the  G3B3  free  energy  of  activation,  AG°298*,  for  the 
abstraction  of  a  hydrogen  from  methane  by  the  hydroxyl  radical 
and  by  0H(H20)„  clusters  and  rate  constants.  Table  5  contains 
scaled  frequency  data  for  the  0H(H20)  dimer  and  OHfFUOh 
cyclic  trimer. 


Discussion 

In  the  0H(H20)  dimer  global  minimum,  OH  is  the  hydrogen- 
bond  donor.2’3,25-34  The  electronic  state  in  the  G3  structure  has 
the  radical  electron  in  the  in-plane  2A'  configuration,  which  is 
recognized  as  the  global  minimum  for  OH^O).2’3-28’33-34  A 
second  electronic  state  is  close  by,  with  the  radical  electron  in 
the  out-of-planc  orbital,  which  corresponds  to  the  higher  energy 
2 A"  state.26,28’32-34  The  excited  state  lies  ^200  cm-1  above  the 
ground  state.2  The  hydrogen-bond  length  for  the  ground  state 
of  1.912  A  compares  well  with  the  microwave  values  of  1.952 

A2  and  1 .945  A.3  The  transition  state  for  the  0H(H20) — H - 

CH3  structure  is  also  in  the  2A'  electronic  state,  and  the 
transition-state  structure  clearly  shows  that  the  radical  electron 
that  is  in-plane  is  forming  the  bond  with  the  hydrogen  that  is 
being  abstracted  from  methane  (Supporting  Information). 

The  low-energy  structures  for  the  0H(H20)„  trimer,  tetramer, 
pentamer,  and  hexamer  clusters  at  298  K  are  cyclic,  which  are 
similar  to  the  analogous  pure  water  clusters.10’12  A  previous 
study  by  Hamad,  Lago,  and  Mejias  presents  structures  for  the 


dimer  through  hexamer  using  the  UMP2,  BLYP,  and  BHLYP 
methods.29  They  found  similar  structures  for  MP2  energy 
minima  for  the  dimer,  trimer,  and  tetramer  but  different  energy 
minima  for  the  pentamer  and  hexamer.  Mejias  and  co-workers’ 
calculations  using  the  BHLYP  method  reveal  similar  energy 
minima  for  all  cluster  sizes  except  the  pentamer.29  Cabral  do 
Couto  and  co-workers  used  microsolvation  modeling  and 
statistical  mechanics  simulations  to  obtain  0H(H20)„  structures 
where  n  =  1— 6.31  Using  the  MPWlPW91/aug-cc-pVDZ 
method,  they  find  low-energy  structures  that  are  similar  to 
Mejias  and  co-workers’  findings.  For  the  trimer  through 
hexamer,  the  low-energy  structure  is  cyclic,  with  the  free 
hydrogen  on  the  water  molecules  alternating  in  an  up  (u),  down 
(d)  orientation.  Conversely,  Mejias  finds  the  cyclic  trimer  to 
have  water  molecules  with  an  up,  up  (u,u)  orientation  and  the 
cyclic  tetramer  to  have  an  up,  down,  down  (u,d,d)  orientation. 
Schenter  and  co-workers  have  used  CCSD(T)/aug-cc-pVTZ 
potential  scans  of  the  0H(H20)  potential  energy  surface  to 
devise  an  interaction  potential  for  the  ground  and  excited  states 
of  the  OH  radical— H2O  system.7  Their  interaction  potential 
predicts  cyclic  structures  as  the  lowest  energy  conformers  for 
the  trimer  through  pentamer,  and  the  cage  structure  as  the  most 
stable  hexamer.7 

By  our  G3  calculations,  the  A E  energy  minima  at  0  K  are 
the  ud  cyclic  trimer,  udu  cyclic  tetramer,  cyclic  pentamer,  and 
cage  hexamer  (Table  1).  This  agrees  with  previous  MPW1PW91 
DFT  work.31  The  same  structures  are  found  as  minima  after 
adding  thermal  energy  to  obtain  AE°29S,  with  the  exception  of 
the  hexamer,  for  which  the  prism  is  now  the  lowest  energy 
structure.  Gibbs  free-energy  calculations  at  298  K  reveal  the 
most  stable  configurations  at  that  temperature.  Formation  of  the 
uu  cyclic  trimer,  uud  cyclic  tetramer,  cyclic  pentamer,  and  cyclic 
hexamer  have  the  lowest  AG°  values  at  298  K.  These  structures, 
with  the  exception  of  the  cyclic  pentamer,  are  all  different  from 
the  lowest  energy  structures  determined  from  A E°.  This  is  a 
common  occurrence  with  water  clusters  because  the  zero-point 
corrected  energies  yields  the  right  order  for  the  free-energy 
surface  for  these  clusters  in  a  cold  (5—20  K)  molecular  beam, 
while  raising  the  temperature  reorders  the  free-energy  profile.11’12 

Hydrogen  bond  lengths  shorten  as  the  cluster  size  increases 
with  the  longest  bond  length  for  the  OH  to  OH2  hydrogen  bond 
(Supporting  Information).  The  OH  to  OH2  hydrogen  bond  length 
is  longest  for  the  dimer  at  1.9  A,  shortens  to  1 .8  A  for  the  trimer, 
1.73  A  for  the  tetramer,  1.71  A  for  the  pentamer,  and  1.70  A 
for  the  hexamer.  The  HOH  to  OH  hydrogen  bond  also  shortens 
as  cluster  size  increases.  This  suggests  that  the  OH/water 
electron  network  strengthens  with  additional  waters.  The  same 
trend  of  bond  lengths  can  be  seen  in  pure  water  clusters.10  As 
cluster  size  increases  for  0H(H20)„,  n  =  2—5,  the  hydrogen 
bond  angle  approaches  180°.  The  OIKIHO)  dimer  has  a 
hydrogen  bond  angle  of  173.5°,  which  compresses  in  the  more 
constrained  trimer,  then  increases  to  nearly  180°  in  the  cyclic 
pentamer  and  hexamer.  Brauer  et  al.  have  reported  that  the  spin 
dipolar  constants  of  the  dimer  increase  by  about  33%  upon 
complexation  while  the  Fermi  contact  term  changes  from 
—73.25  MHz  in  free  OH  to  —155.3  MHz  in  the  complex.2 
Although  complicated  to  interpret  for  a  multielectron  effect,  they 
state  unequivocally  that  the  large  change  in  the  OH  magnetic 
hyperfine  constants  upon  complexation  leads  to  a  substantial 
change  in  the  electron  distribution  of  the  radical  upon  com¬ 
plexation.2  Similarly,  Ohshima  and  co-workers  have  concluded 
from  their  microwave  results  that  electron  density  is  transferred 
from  the  water  to  the  OH  radical  upon  formation  of  the 
hydrogen-bonded  complex.3  Our  results  lead  us  to  suggest  that, 
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TABLE  1:  G3  Energetics  in  kcal-mol  1  for  the  (i)  Association  of  Cyclic  Water  Clusters  with  Hydroxyl  Radical,  and  (ii) 
Incremental  Association  of  Hydroxyl  Radical-Water  Clusters  with  an  Additional  Water  Molecule 


(i)  (H20)„  +  OH  -  HO(H20)„ 

(ii)  HO(H20), 

+  H20  -  HO(H20)„ 

n 

A£0 

A  E°  298 

A//°  298 

0 

O 

< 

298 

i 

ii 

i 

ii 

i 

ii 

i 

ii 

1 

-3.97 

-3.97 

-3.90 

-3.90 

-4.49 

-4.49 

1.14 

1.14 

2Ud 

-6.93 

-6.16 

-7.58 

-6.59 

-8.17 

-7.18 

1.56 

2.36 

2UU 

-6.83 

-6.05 

-7.32 

-6.33 

-7.91 

-6.92 

1.37 

2.17 

3pud 

-4.84 

-5.17 

-4.51 

-4.83 

-5.10 

-5.42 

1.41 

1.74 

3uud 

-8.79 

-9.12 

-8.64 

-8.97 

-9.24 

-9.56 

-1.74 

-1.38 

3udu 

-9.03 

-9.35 

-9.04 

-9.36 

-9.63 

-9.95 

-1.48 

-1.14 

4S4+OH 

-5.39 

-6.20 

-5.32 

-6.18 

-5.91 

-6.77 

2.18 

2.39 

4D2d 

-4.87 

-5.68 

-4.74 

-5.60 

-5.33 

-6.19 

2.48 

2.70 

4udu-Hl20 

-2.54 

-3.36 

-2.03 

-2.90 

-2.62 

-3.49 

4.10 

4.32 

4pentamcr 

-5.39 

-7.57 

-5.32 

-7.29 

-7.02 

-7.88 

-0.23 

-0.01 

^hexamcr 

-5.84 

-6.32 

-5.57 

-5.97 

-6.16 

-6.56 

-3.30 

0.83 

5  book 

-4.56 

-5.04 

-4.56 

-4.97 

-5.16 

-5.56 

-0.55 

3.58 

5pcn(amcr+OM 

-4.29 

-4.77 

-4.34 

-4.75 

-4.94 

-5.34 

-0.60 

3.53 

5c«gc 

-6.27 

-6.75 

-6.81 

-7.21 

-7.40 

-7.80 

-0.55 

3.58 

5prism 

-5.87 

-6.34 

-6.33 

-6.73 

-6.92 

-7.32 

-0.13 

3.99 

TABLE  2:  Gibbs  Free  Energies  (Standard  State  of  1  M)  in  kcahmol-1,  Equilibrium  Constants  (Standard  State  of  1  M), 
Molarity  (A/),  and  Number  of  Hydroxyl  Radical  Clusters  per  Cubic  Centimeter  (A)  Predicted  to  be  Present  in  the  Lower 
Troposphere  when  the  Water  Concentration  is  0.001544  M,  and  the  OH  Concentration  is  1.66058  x  10  14  M,  at  298  K 


reaction 

AG°298 

Kc 

M 

N 

OH  +  H:0  —  HO(H20) 

1.14 

3.56 

9.1  X  10-'7 

5.5  X  I0J 

HO(H20)  +  H20  —  H0(H20)2 

2.17 

0.631 

6.4  x  10"20 

4  x  10' 

H0(H20)  2  +  H20  —  H0(H20)3 

-1.38 

251 

2.4  x  lO"20 

1.4  x  10' 

HO(H2Oh  +  H20  —  H0(H20)4 

-0.0144 

25.1 

9.1  x  10"22 

5.5  x  10"' 

H0(H20)4  +  H20  —  HO(H20)s 

0.828 

6.05 

8.5  x  10"24 

5  x  10"' 

TABLE  3:  Change  in  Electronic  Energy,  Gibbs  Free  Energy,  Enthalpy,  Entropy,  and  Activation  Energy  for  the  Formation  of 

CH3  and  HzO  from  OH  and  CH4fl 

AE0 

A//°  298 

AG°298 

A  S°  298 

E*h 

BHandHLYP/aug-cc-pVTZ* 

reactants  —  pre-reactive  complex 

0.20 

0.72 

1.43 

-7.85 

reactants  —  transition  state 

8.36 

7.37 

13.30 

-20.71 

8.56 

reactants  —  post-reactive  complex 

-10.04 

-9.44 

-7.50 

-8.34 

reactants  —  product 

-9.83 

-9.42 

-12.23 

9.51 

CCSD(T)/aug-ec-pVTZc 

reactants  — *  pre-reactive  complex 

-0.15 

0.37 

1.09 

-2.39 

reactants  —  transition  state 

4.51 

3.53 

9.45 

-19.87 

4.72 

reactants  —  post-reactive  complex 

-14.21 

-13.61 

-11.68 

-6.50 

reactants  — *  product 

-13.46 

—  13.05 

-15.86 

9.43 

pre-reactive  complex  —  transition  state 

4.66 

3.16 

8.36 

-17.48 

transition  state  — •  post-reactive  complex 

-18.73 

-17.14 

-21.13 

13.37 

post-reactive  complex  —  product 

0.75 

0.56 

-4.18 

15.92 

experimental  data 

reactant  — *  transition  state4' 

3.89 

5.08 

47  Interaction  energy  given  in  kcabmol  and  the  change  in  entropy  given  in  cal*mol 

‘•Kelvin  '.  A£0  includes  zero-point  vibrational  energy. 

h  Activation  energy  determined  using  the  equation  £a 

=  +  2RT. 

c  Single  self-consistent  field  calculation  performed  on  BhandHLYP/aug- 

cc-pVTZ  optimized  geometries.  (1  Reference  42. 


similar  to  pure  water  clusters,  the  cooperativity  of  hydrogen- 
bond  formation  distributes  the  partial  atomic  charge  throughout 
the  cluster.  This  effect  for  homodromic  cyclic  water  clusters 
was  first  explained  by  Xantheas,35  and  can  be  seen  in  homo¬ 
dromic  cyclic  water  structures  for  trimers,  tetramers,  pentamers, 
and  octamers.11’12  The  cyclic  structures  that  display  cooperativity 
have  their  individual  water  dipoles  building  upon  each  other  in 
the  plane  of  the  cycle,  resulting  in  more  negative  charge  on  the 
oxygens  and  more  positive  charge  on  the  hydrogens  in  coopera¬ 
tive  complexes.12  The  partial  atomic  charges  on  OH,  H2O,  and 
OH(H20)  derived  from  the  electrostatic  potential  at  the  UMP2- 
(full)/6-31G*  level  reveal  that  the  water  hydrogens  become  more 
positive  by  0.014  atomic  charge  in  the  complex,  the  hydroxyl 


hydrogen  becomes  more  positive  by  0.050  atomic  charge,  and 
the  hydroxyl  oxygen  becomes  more  negative  by  0.074  atomic 
charge. 

The  concentration  of  clusters  in  the  atmosphere  at  298  K  is 
estimated  in  Table  2,  for  a  water  vapor  concentration  of 
0.001544  M  and  a  hydroxyl  concentration  of  1.66058  x  10-14 
M.  The  equilibrium  constant,  Kc,  (1  M  standard  state)  is 
calculated  from  A G°  (1  atm  standard  state)  values  from  Table 
1  (ii).  This  table  shows  the  incremental  change  in  free  energy 
for  an  existing  OH(H20)„  cluster  upon  incorporating  another 
water  molecule  into  the  complex,  and  the  abundances  of  specific 
clusters.  The  most  abundant  of  the  clusters,  OUff^O),  is 
predicted  to  have  a  concentration  on  the  order  of  104 
molecules*cm“3  on  a  warm  humid  day.  The  dimer  concentration 
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Reactants  Pre-Reactive  Transition  Post -Reactive  Products 

Intermediate  State  Intermediate 


Figure  2.  Two-dimensional  view  of  the  potential  energy  surface  for  the  reaction  of  OH  +  CH4  ~ 1 -  H20  +  CH3  using  the  CCSD(T)/aug-cc-pVTZ// 
BHandHLYP/aug-cc-pVTZ  results.  Energies  in  kcahmol-1  and  entropies  in  kcal-mol-l*K_l. 

TABLE  4:  G3B3  Change  in  Energy,  Enthalpy,  Gibbs  Free  Energy,  and  Activation  Energy  with  Calculated  Transition-State 
Theory  Rate  Constants  at  298  K  for  the  Formation  of  a  CH3(H20)„f  1  Cluster  from  0H(H20)„  and  CH4fl 


CH4  +  0H(H20)„  —  CH3+(H20)„+, 


n 

A£0 

AE°  298 

A  H°  298 

AG°298 

AG°;W» 

k*> 

If 

0 

-13.4 

-14.5 

-14.0 

-12.5 

9.7 

3.9  X  I014 

2.0  x  10-|J 

1 

-13.1 

-13.7 

-13.7 

-8.2 

13.7 

2.4  x  10-'6 

8.7  x  10-'6 

2 

-14.8 

-15.8 

-15.4 

-9.1 

10.6 

3.0  x  I0-'5 

1.2  x  10"14 

3 

-14.3 

-15.4 

-14.9 

-8.2 

10.5 

1.3  x  10~15 

6.9  x  10" 15 

4 

-14.5 

-15.4 

-15.1 

-7.4 

12.7 

8.9  x  10-'7 

4.1  x  10"'6 

5 

-14.0 

-14.9 

-14.6 

-7.3 

11.7 

3.5  x  10-'6 

1.5  x  lO'15 

u  Interaction  energies  are  reported  in  kcal-mol  and  the  rate  constants  are  reported  with  units  of  cm3-molccule  Ls  h  Rate  constant  determined 
without  proton  tunneling.  c  Rate  constant  determined  with  Eckart  proton  tunneling. 


is  on  the  same  order  of  magnitude  as  the  water  hexamer,  which 
would  be  difficult  to  detect  in  the  troposphere.9  However,  this 
does  not  rule  out  the  possibility  that  the  0H(H20)M  clusters  could 
have  an  important  role  in  atmospheric  chemistry.  This  is  an 
equilibrium  calculation  based  on  an  initial  population  for  the 
hydroxyl  radical  of  107  radicals/cm3.  If  local  conditions  produced 
a  lOOOth-fold  increase  in  the  OH  concentration,  then  the  OH- 
(H2O)  cluster  concentration  would  increase  to  107  clusters'cm-3. 
Measurement  of  OH  concentrations  are  complicated,36,37  and 
researchers  are  continually  trying  to  improve  modeling  and 
experimental  methods  for  accurate  determination  of  this  im¬ 
portant  radical.38-42  There  is  evidence  for  substantial  variation 
of  the  concentration  of  hydroxyl  radicals  over  the  past  several 
decades.41  Field  measurements  and  modeling  studies  differ 
between  observation  and  predictions  of  OH  concentration  in 
the  atmosphere,  and  the  importance  of  weak  electronic  absorp¬ 
tion  features  in  contributing  to  significant  OH  production  has 
been  demonstrated.43  On  the  basis  of  the  currently  accepted 


values  for  the  abundances  of  OH,  and  our  thermodynamic 
results,  we  do  not  expect  clusters  of  0H(H20)W  to  be  abundant 
in  the  atmosphere. 

To  put  the  reactivity  of  0H(H20)„  clusters  in  context,  consider 
Table  3,  which  illustrates  the  reactivity  of  the  clusters  by 
displaying  the  values  for  removing  a  hydrogen  from  methane. 
The  oxidation  of  methane  by  the  hydroxyl  radical  is  one  of  the 
two  major  sources  of  water  in  the  stratosphere,  the  other  being 
water  injection  from  the  troposphere.44  Experimentally,  the 
activation  barrier  ( E a)  for  the  reaction 

CH4  +  OH  —  CH3  +  H20 

is  5.08  kcabmol-1,  and  when  corrected  for  2RT results  in  a  A IP* 
of  3.89  kcabmol-1.45  The  CCSD(T)/aug-cc-pVTZ  method  yields 
a  A//°*298  of  3.5  kcabmol-1  and  an  Ea  of  4.7  kcabmol-1,  in 
agreement  with  the  experimentally  observed  activation  barrier. 
The  G3B3  method  is  also  in  good  agreement,  producing  values 
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TABLE  5:  Vibrational  Modes,  Frequencies  in  cm-1,  and 
Relative  IR  Intensities  for  the  0H(H20)  Dimer  and 
0H(H20)2  Cyclic  Trimer  from  HF/6-31G*  Scaled  Harmonic 
Frequencies0 


mode 

frequency 

IR  intensity 

V3 

OH(H20) 

3738 

m 

V| 

3636 

w 

O— H  stretch 

3517 

s 

vz 

1624 

m 

out-of-plane  u) 

541 

s 

in-plane  a) 

395 

s 

heavy  atom  stretch 

173 

w 

r  (H20),  (OH) 

135 

w 

(o  (H20) 

122 

s 

V3  (D) 

OH(H20)2 

3717 

m 

v3  (A) 

3711 

m 

v,  (D) 

3596 

m 

v,  (A) 

3563 

s 

OH  stretch 

3439 

s 

v2c  (AD) 

1651 

m 

v2nc  (A/D) 

1635 

m 

coc  (ADOH) 

767 

m 

o>nc  (AD/OH) 

550 

s 

a)nc  (A/OH) 

459 

s 

rnc  (A/D) 

407 

s 

pc  (AD),  to  (OH) 

336 

m 

pnc  (A/D) 

257 

m 

heavy  atom  str  (AOH) 

202 

w 

distortion 

170 

w 

a)  (A,  free  H) 

166 

m 

heavy  atom  str  (DOH) 

136 

w 

a)  (AD,  free  H) 

100 

m 

°  The  relative  intensity  for  infrared  absorption  (IR)  are  characterized 
as  weak  (w;  <  40  KM/molc),  medium  (m;  60—160  KM/mol),  or  strong 
(s;  >  195  KM/mol),  and  the  scale  factor  is  0.8929.  Key  to  mode 
motion:  c  for  concerted,  nc  for  nonconcerted,  co  for  wag,  p  for  rock, 
and  r  for  twist.  The  “plane”  refers  to  the  mirror  symmetry  plane  of  the 
dimer.  For  the  trimer,  D  and  A  stand  for  the  donor  and  acceptor  waters, 
respectively. 

for  A//°298*  of  3.8  kcahmol"1  and  for  AG°298*  of  9.7  kcabmol-1. 
A  two-dimensional  slice  of  the  potential  energy  surface  for  this 
reaction  is  shown  in  Figure  2.  This  curve  was  generated  by 
taking  the  energy  at  each  point  relative  to  the  preceding  point, 
considering  the  forward  reaction  above  and  is  also  reported  in 
Table  3.  The  curve  can  be  described  by  three  different  types  of 
energies:  zero-point  vibrationally  corrected  electronic  energy 
(AE0),  enthalpy  (A//°298),  and  Gibbs  free  energy  (AG°298).  A 
pre-reactive  and  post-reactive  intermediate  are  formed  on  the 
AE0  surface,  whereas  only  the  post-reactive  intermediate  is  seen 
on  the  A//°298  surface.  Formation  of  the  pre-reactive  complex 
from  the  reactants  has  an  cntropic  cost,  as  does  formation  of 
the  transition-state  structure  from  the  pre-reactive  complex. 
Formation  of  the  post-reactive  intermediate  from  the  transition- 
state  structure  has  a  favorable  entropy  change,  as  does  produc¬ 
tion  of  the  final  products  from  the  post-reactive  intermediate. 
At  room  temperature  and  with  entropy  taken  into  account, 
neither  intermediate  is  a  stationary  point  on  the  AG°298  surface. 
The  free  energy  of  activation  at  room  temperature  is  the  free 
energy  for  the  transition  from  the  reactants  to  the  transition  state, 
1 .09  +  8.36,  or  9.45  kcabmol  (Figure  2).  As  the  temperature  is 
lowered  toward  zero  Kelvin,  the  A/7°298  and  AG°298  surfaces 
converge  to  the  AEo  surface.  At  very  low  temperatures,  as  found 
in  a  molecular  beam,  it  would  be  possible  to  observe  both  the 
pre-  and  post-intermediates  of  this  reaction. 

Given  the  expense  of  the  CCSD(T)  calculations,  we  found 
that  the  model  chemistry  G3B3  method  was  able  to  provide 
reasonable  results  at  a  lower  cost,  as  seen  in  comparing  Table 
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4  to  Table  3.  As  shown  in  the  table,  the  hydration  of  OH  actually 
reduces  the  Gibbs  free  energy  released  by  the  reaction,  as  well 
as  increasing  the  Gibbs  activation  energy.  In  general,  water 
clusters  reduce  the  oxidative  ability  of  OH  with  respect  to 
methane.  Thus,  it  appears  that  any  water  clusters  that  form  about 
the  hydroxyl  radical  will  reduce,  not  enhance,  the  rate  of  reaction 
of  OH  with  organic  species  in  the  atmosphere.  We  attribute 
this  effect  to  the  enhanced  stability  of  OH(H20)„  clusters  as 
the  number  of  waters,  n,  increases.  As  each  successive  water  is 
added  to  make  the  most  stable  cluster,  the  OH  to  OH2  hydrogen 
bond  strengthens,  as  does  the  OH/water  electron  network.  This 
result  contradicts  an  earlier  conclusion  based  on  semiempirical 
quantum  mechanical  calculations.6  Karakus  and  Ozkan  have 
shown  that  the  difference  in  electronic  energy  between  transi¬ 
tion-state  structures  and  reactants  for  the  reaction  of  OH(H20) 
abstracting  a  hydrogen  from  alkanes  decrease  as  the  hydrocarbon 
chain  is  increased  from  one  (methane)  to  three  (propane).46 

The  experimental  rate  constant  CH4  +  OH  — *  CH3  +  H20 
has  been  determined  by  multiple  research  groups  for  temperature 
ranges  from  1240  to  190  K.47-52  The  most  recent  experimental 
determination  of  the  rate  constant  is  6.29  ±  0.18  x  10-15  to 
6.8  ±  0.14  x  10~15  cm3•molecule“1•s-,  at  298  K.53  A 
comprehensive  review  by  Atkinson  summarizes  the  methods 
and  rate  constants  obtained  from  experiments  from  1962  to 
2002,  and  he  recommends  a  value  of  6.40  x  10-15  cm3*mole- 
cule~us_1.52  Using  the  G3B3  model  chemistry  results  and 
TheRate  on-line  program,  we  determined  rate  constants  for  this 
reaction  to  be  3.9  x  10-14  cm3*molccule“1,s_l  and  2.0  x  10“13 
cm3,molecule"1*s~1  when  including  tunneling  in  the  rate 
constant  calculation.  Both  of  these  values  are  approximately 
10  and  100  times  larger  than  the  experimental  value.  In  1993, 
Melissas  and  Truhlar  used  MP2-SAC2  model  chemistry  results 
and  interpolated  canonical  variational  transition-state  theory  with 
tunneling  and  obtained  rate  constants  at  298  K,  4.12—5.48  x 
10“ 15  cm3#molecule“1#s_1,  results  that  are  remarkably  close  to 
today’s  accepted  values.23,54 

We  can  draw  some  conclusions  from  Table  4  if  we  consider 
that  the  error  in  the  rate  calculation  is  systematic  and  we  only 
examine  the  relative  rates  as  a  function  of  hydration  extent  of 
the  hydroxyl  radical.  Considering  the  tunneling-corrected  rate 
constant,  the  hydration  of  the  hydroxyl  radical  decreases  the 
rate  by  10-  to  1000-fold,  the  slowest  rate  obtained  being  the 
hydration  of  the  radical  by  a  single  water  molecule.  As  discussed 
above,  we  predict  that  the  OH(H20)„  cluster  will  have  an 
abundance  of  5.5  x  104  molecules  in  the  lower  atmosphere, 
and  a  1000-fold  decrease  in  its  reaction  with  methane. 

The  hydroxyl  OH  stretch  of  the  dimer  has  been  observed  in 
an  argon  matrix  at  1 1.5  K  and  assigned  to  bands  at  3452  and 
3428  cm-1.28  Later  work  that  coupled  these  experiments  with 
highly  accurate  anharmonic  oscillator  local-mode  calculations 
revealed  three  bands,  at  3452.2,  3428.0,  and  3442.1  cm-1.  These 
three  sites  most  likely  stem  from  different  sites  within  the  matrix, 
and  not  from  different  structures,  and  their  scaled  model  gives 
a  single  value  of  3479.0  cm-1.33  Our  gas-phase  value  is  3517 
cm-1,  well  within  the  perturbations  induced  by  the  argon 
matrix.20  For  spectroscopic  detection  of  OH(H20)„  clusters,  we 
will  focus  on  those  vibrations  that  are  unique  and  have  strong 
intensities  for  the  dimer  and  trimer  because  larger  clusters  are 
probably  not  abundant  enough  for  detection.  We  have  shown 
previously  that  HF/6-31G*  scaled  frequencies  for  water  clusters 
have  standard  deviations  of  20— 25  cm-1  compared  to  experi¬ 
mental  and  to  MP2  anharmonic  frequencies.20  According  to  the 
scaled  HF/6-31G*  frequencies,  the  hydroxyl  radical  has  a 
vibration  at  3569  cm-1.  The  OH(H20)  dimer  has  an  OH  stretch 
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red  shifted  from  the  OH  monomer  by  52  cm  "1,  whereas  the 
0H(H20)2  trimer’s  OH  stretch  is  red  shifted  from  the  dimer  by 
78  cm1.  Because  of  this  difference  in  frequencies,  the  trimer 
could  possibly  be  identified  by  IR  spectroscopy  through  the  3439 
cm  '1  OH  stretch.  An  isolated  dimer  could  be  observed  at  its 
3517  cm-1  OH  stretch,  but  this  vibration  would  most  likely  be 
overlapped  by  a  v\  (H20)3  vibration  at  3519  cm-1  and  a  v\ 
(H20)4  vibration  at  3521  cm-1  unless  the  water  concentration 
could  be  reduced  significantly  in  the  IR  experiment.20 

The  intermolecular  vibrational  modes  at  541  cm-1  (an  out- 
of-plane  wag),  395  cm"1  (in-plane  wag),  and  122  cm ”l  (wag) 
are  three  possible  targets  for  detecting  0H(H20).  The  v\ 
vibration  of  0H(H20)2  at  3563  cm-1,  and  intermolecular 
vibrational  modes  at  550,  459,  and  407  cm-1  have  strong  IR 
intensities  that  could  assist  in  the  detection  of  0H(H20)2.  The 
550  cm"1  vibration  is  a  nonconcerted  wag  of  the  acceptor  and 
donor  waters  wagging  out-of-phase  with  the  OH  wagging 
motion.  This  0H(H20)2  peak  could  potentially  be  overlapped 
if  the  water  dimer  (vibration  at  553  cm"1)  or  water  pentamer 
(vibration  at  449  cm-1)  were  present  under  experimental 
conditions.20  A  similar  overlap  with  the  407  cm"1  vibration  of 
OH(H20)2,  a  nonconcerted  twist  of  the  acceptor  water  and  donor 
water,  which  would  be  hidden  by  an  intermolecular  mode  of 
the  water  trimer.  If  such  water  clusters  were  present  in  the 
experimental  setup,  then  the  459  cm”1  vibration  of  0H(H20)2 
(a  nonconcerted  wagging  of  the  acceptor  water  and  hydroxyl 
radical)  is  unique  from  other  clusters,  thus  a  good  target  for 
detection.  Another  possibility  for  observing  these  complexes  is 
to  probe  electronic  transitions.  The  electronic  transition  2X+  — 
2n  occurs  around  308  nin,  and  this  transition  is  predicted  to 
red  shift  by  about  2500  cm" 1  upon  formation  of  the  OH(H20) 
complex.34 

Conclusions 

The  G3  model  chemistry  has  been  used  to  determine  the 
structures,  energetics,  and  frequencies  of  OH(H20)„  clusters. 
The  lowest  electronic  energy  structures  agree  with  previous 
theoretical  results.  Free-energy  calculations  at  298  K  reveal  that 
the  most  likely  structures  to  be  observed  at  room  temperature 
usually  differ  from  the  lowest  temperature  structures.  The  lowest 
free-energy  structures  display  cooperativity  of  hydrogen  bond¬ 
ing,  maximizing  the  negative  partial  atomic  charge  on  oxygens 
and  the  positive  partial  atomic  charge  on  hydrogens  involved 
in  hydrogen  bonding  in  these  structures.  The  OH(H20)  geometry 
is  in  good  agreement  with  the  experimental  microwave  structure. 
Predicted  abundances  of  the  OH(H20)„  clusters  reveal  that  these 
clusters  are  not  predicted  to  be  abundant  in  the  atmosphere. 
Calculation  of  free  energies  of  activation  for  the  abstraction  of 
a  hydrogen  from  methane  predict  that  water  impedes  the  ability 
of  the  hydroxyl  radical  to  oxidize  organic  molecules,  a  result 
that  is  contrary  to  published  hypotheses. 
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ABSTRACT:  A  series  of  CCSD(T)  single-point  calculations  on  MP4(SDQ)  geometries 
and  the  Wl  model  chemistry  method  have  been  used  to  calculate  AH0  and  AG°  values 
for  the  deprotonation  of  17  gas-phase  reactions  where  the  experimental  values  have 
reported  accuracies  within  1  kcal/mol.  These  values  have  been  compared  with  previous 
calculations  using  the  G3  and  CBS  model  chemistries  and  two  DFT  methods.  The  most 
accurate  CCSD(T)  method  uses  the  aug-cc-pVQZ  basis  set.  Extrapolation  of  the  aug-cc- 
pVTZ  and  aug-cc-pVQZ  results  yields  the  most  accurate  agreement  with  experiment, 
with  a  standard  deviation  of  0.58  kcal/mol  for  AG°  and  0.70  kcal/mol  for  AH°. 

Standard  deviations  from  experiment  for  AG°  and  AH°  for  the  Wl  method  are  0.95  and 
0.83  kcal/mol,  respectively.  The  G3  and  CBS-APNO  results  are  competitive  with  Wl 
and  are  much  less  expensive.  Any  of  the  model  chemistry  methods  or  the  CCSD(T)/ 
aug-cc-pVQZ  method  can  serve  as  a  valuable  check  on  the  accuracy  of  experimental 
data  reported  in  the  National  Institutes  of  Standards  and  Technology  (NIST)  database. 
©2006  Wiley  Periodicals,  Inc.  Int  J  Quantum  Chem  106:  3122-3128,  2006 
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Introduction 

The  ability  of  modem  quantum  chemical  meth¬ 
ods  to  calculate  gas-phase  reaction  and  free 
energies  accurately  is  essential  for  a  variety  of 
chemical  predictions,  such  as  the  accurate  a  priori 
calculation  of  gas-phase  equilibrium  constants. 
Computed  AG°  values  must  be  correct  to  within  1 
kcal/mol  to  achieve  chemical  accuracy.  We  have 
recently  evaluated  the  ability  of  the  Gaussian  (G n) 
model  chemistries  [1,  2]  and  the  complete  basis  set 
(CBS)  model  chemistries  [3-5]  to  model  gas-phase 
deprotonation  accurately,  by  examining  their  per¬ 
formance  against  17  reactions  in  the  National  Insti- 
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tutes  of  Standards  and  Technology  (NIST)  database 
[6],  where  the  deprotonation  AG°  is  reported  as 
highly  accurate  (defined  as  an  experimental  error  of 
<1  kcal/mol)  [7].  The  high  computational  cost  of 
the  model  chemistries  led  us  to  evaluate  a  large 
range  of  density  functional  theory  methods  against 
the  same  experimental  database  [8].  Additionally,  a 
search  for  the  most  accurate  methods  available, 
combined  with  a  desire  to  check  the  experimental 
data,  led  us  to  determine  energies  with  the 
CCSD(T)/aug-cc-pVnZ  model  using  MP4(SDQ)  ge¬ 
ometries.  In  the  present  work,  we  report  on  the 
performance  of  CCSD(T)  [9]  and  the  W1  model 
chemistries  [10]  against  this  experimental  data  set 
[6]. 


Methods 

We  used  the  W1  [10]  model  chemistry  and 
CCSD(T)  [9]  (coupled  cluster  with  all  single  and 
double  substitutions  along  with  a  quasi-perturba- 
tive  treatment  of  connected  triple  excitations)  im¬ 
plemented  within  Gaussian  03  [11]  and  Gaussian  98 
[12].  The  coupled  cluster  calculations  included  tri¬ 
ple  excitations  for  both  the  complete  fourth-order 
Moller-Plesset  (MP4)  and  the  CCSD(T)  energies 
(E4T  keyword).  The  coupled  cluster  single-point 
energy  calculations  used  the  augmented  correlation 
consistent  polarized  n-tuple  zeta  basis  sets  (aug-cc- 
pVnZ,  n  =  D,  T,  Q,  5)  of  Dunning  and  coworkers 
[13-15].  These  calculations  were  performed  on  ge¬ 
ometries  obtained  using  MP4  perturbation  theory 
with  single,  double,  and  quadruple  substitutions 
[MP4(SDQ)]  [16].  These  optimizations,  and  their 
corresponding  frequency  calculations,  employed 
the  aug-cc-pVTZ  basis  set.  The  frequency  calcula¬ 
tions  ensured  that  all  structures  were  optimized  to 
a  true  energetic  minima  on  the  potential  energy 
surface  (PES),  and  the  unsealed  thermochemical 
corrections  were  used  to  obtain  the  zero-point  en¬ 
ergy  (ZPE),  enthalpy,  and  Gibbs  free  energy.  Fur¬ 
thermore,  to  estimate  the  energy  at  the  complete 
basis  set  limit,  we  have  performed  a  series  of  two- 
point  extrapolations  on  the  correlation  energy  [10, 
17-19]  In  this  scheme  [Eqs.  (l)-(3)],  an  extrapolated 
value  for  the  correlation  contribution  to  the  total 
energy  is  obtained  using  two  consecutive  correla¬ 
tion  energies,  x  -  1  and  x,  and  is  then  added  to  a 
non-extrapolated  Hartree-Fock  energy: 


x3Ecxon  -  (x  -  l)3Ecx°-r; 

Lx~l'x  X3  -  (x  - 1)3  (  ] 

+  (3) 


Additionally,  we  corrected  for  basis  set  superpo¬ 
sition  error  (BSSE)  in  the  HC1  and  H20  systems, 
using  the  Boys-Bernardi  functional  counterpoise 
scheme  (fCP)  [20]. 

The  W1  model  chemistry  was  developed  to  be  an 
affordable  and  accurate  method  for  the  determina¬ 
tion  of  thermochemistry  [10].  W1  contains  one  mol¬ 
ecule-independent  empirical  parameter  and  is 
based  on  a  series  of  coupled-cluster  calculations, 
using  a  variety  of  robust  basis  sets.  These  calcula¬ 
tions  employ  inner  shell  correlation  contributions, 
scalar  relativistic  corrections,  and  two  separate  two- 
point  extrapolation  schemes  for  both  the  valence 
CCSD  energy  and  valence  (T)  energy  contributions. 

Obtaining  enthalpies  and  free  energies  of  depro¬ 
tonation  requires  the  value  of  H°  and  G°  for  the 
proton.  We  used  the  translational  energy  of  1.5RT 
combined  with  PV  -  RT  (H  =  E  +  PV)  to  obtain  a 
value  of  H°  (H+)  equal  to  5/2 (RT),  or  1.48  kcal/mol. 
We  used  the  Sackur-Tetrode  equation  (see  Ref. 
[21])  to  obtain  the  entropy,  TS(H+)  =  7.76  kcal/mol 
at  298K  and  1  atm  pressure,  which  gives  a  value 
(G  =  H  -  TS)  for  G°  (H+)  of  -6.28  kcal/mol  [22, 
23].  All  values  reported  in  the  present  work  are  for 
a  standard  state  of  1  atm. 


Results 

Tables  I  and  II  compare  the  model  chemistry 
results  for  17  deprotonation  reactions  with  the  ex¬ 
perimental  results  reported  in  the  NIST  database 
[6].  Also  presented  are  previously  computed  ther¬ 
mochemical  results  using  the  CBS-QB3  [5],  CBS- 
APNO  [4,  24],  and  G3  [2]  model  chemistries  [7],  as 
well  as  the  B3P86  and  PBE1PBE  DFT  methods  [8]. 
The  standard  deviation  from  experiment,  a,  is  also 
included  in  Tables  I  and  II.  The  experimental  values 
were  taken  from  the  NIST  database  [6].  Selected 
systems  favored  small  reported  error  bars,  and  re¬ 
cently  published  data  [25-40]. 


Discussion 

MODEL  CHEMISTRIES 


pcorr  _  pCCSD(T)  _  pHF 

X  t-'X  ‘-‘X 


The  W1  and  CBS-APNO  model  chemistry  results 
(1)  have  standard  deviations  of  ~1  kcal/mol  relative  to 
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the  experimental  values,  while  the  computationally 
less  expensive  G3  method  yields  standard  devia¬ 
tions  of  1.27  and  1.17  kcal/mol  for  enthalpy  and 
free  energy  changes.  The  CBS-QB3  method,  which 
uses  DFT  geometries,  has  standard  deviations  of  1.5 
and  1.4  kcal/mol  for  enthalpy  and  free  energy 
changes.  Wl  is  the  only  method  that  has  a  higher 
standard  deviation  for  free  energy  changes  than 
enthalpy  changes,  suggesting  that  the  scaling  of  the 
B3LYP/cc-pVTZ  frequencies  within  that  method  is 
not  optimal  for  free  energies. 

A  comparison  of  the  model  chemistry  results  to 
the  CCSD(T)  results  is  only  possible  for  the  smallest 
10  molecules  at  the  top  of  Table  I  because  of  the 
expense  of  the  coupled  clusters  calculation  scheme. 
The  standard  deviations  for  the  CBS-QB3,  G3,  CBS- 
APNO,  and  Wl  results  of  the  10  atom  set,  <rtop/  are 
1.8,  1.6,  1.3,  and  1.0  for  enthalpies  and  1.6,  1.5,  1.2, 
and  1.2  for  free  energies.  This  is  interesting  for  two 
reasons,  the  first  being  that  the  coupled  cluster 
method  using  a  quadruple-zeta  basis  set  is  signifi¬ 
cantly  more  accurate  than  the  model  chemistries. 
The  second,  which  may  be  more  important  for  us¬ 
ers  of  these  methods,  is  that  the  model  chemistries 
are  more  accurate  for  the  seven  largest  molecules  in 
this  test  set.  As  shown  by  a-bottom,  the  G3,  CBS- 
APNO,  and  Wl  model  chemistries  are  better  for  the 
larger  molecules  which  are  extremely  costly  to  cal¬ 
culate  with  the  CCSD(T)  method. 

The  G3  and  CBS-APNO  methods  model  the  ge¬ 
ometries  and  energies  of  hydrogen-bonded  water 
and  ion-water  clusters  quite  accurately  [41-48],  and 
this  deprotonation  study  adds  to  the  body  of  evi¬ 
dence  that  shows  the  reliability  of  these  two  model 
chemistries.  Furthermore,  the  Wl  method,  which  is 
computationally  more  expensive,  is  also  highly  ac¬ 
curate.  These  three  model  chemistries  are  more  re¬ 
liable  than  DFT  methods  that  use  the  aug-cc-pVTZ 
basis  set  for  energy  calculations  [8],  and  are  com¬ 
parable  with  the  best  CCSD(T)  results  displayed  in 
Tables  I  and  II. 

COUPLED  CLUSTER  CALCULATIONS 

Tables  I  and  II  show  that  the  CCSD(T)/aug-cc- 
pVnZ  method  has  the  best  overall  agreement  with 
experiment  when  n  =  Q.  While  collectively  the  n  = 
5  results  are  worse  than  n  =  Q,  when  the  HC1 
outlier  is  removed  the  accuracy  of  the  two  methods 
are  roughly  equivalent.  Because  the  errors  associ¬ 
ated  with  the  n  =  T  and  Q  calculations  systemati¬ 
cally  decrease  from  T  to  Q,  the  TQ  two-point  ex¬ 
trapolation  results  are  in  excellent  agreement  with 


experiment.  Conversely,  there  are  no  systematic 
changes  in  the  errors  between  the  n  =  Q  and  5 
calculations,  which  is  reflected  in  the  extrapolation 
results.  Ignoring  the  HC1  outlier,  the  calculated  val¬ 
ues  for  enthalpy  and  free  energy  of  reaction  are 
virtually  unchanged  as  the  basis  is  expanded  from 
n  =  Q  to  5.  Excluding  HC1,  recalculation  of  the 
standard  deviation  from  experiment  for  the  Q5  ex¬ 
trapolation  results  in  a  falling  from  1.59  to  0.86  and 
from  1.69  to  1.10  for  free  energy  and  enthalpy  re¬ 
spectively,  a  nearly  equivalent  result  to  the  TQ 
extrapolation.  We  feel  that  ignoring  the  HC1  outlier 
is  justifiable,  particularly  within  the  context  of  our 
extrapolation  scheme  because  the  error  in  the  HC1 
calculated  correlation  energy  increases  as  the  basis 
set  expands,  and  our  two-point  extrapolation 
scheme  amplifies  this  error.  The  failure  of  the  quin- 
tuple-zeta  HC1  calculation  inspired  us  to  perform  a 
very  limited  set  of  fCP  corrections.  We  investigated 
BSSE  errors  for  the  aforementioned  HC1  system,  as 
well  as  the  HzO  system,  which  enjoyed  favorable 
agreement  with  experiment.  As  shown  by  the  val¬ 
ues  in  italics  in  Tables  I  and  II,  correction  for  BSSE 
brings  the  values  for  the  n  =  T,  Q,  and  5  basis  sets 
into  better  agreement,  and  the  extrapolated  TQ  and 
Q5  values  are  now  both  in  excellent  agreement  with 
experiment.  Because  of  the  extreme  expense  of 
these  calculations,  we  were  unable  to  perform  fCP 
corrections  for  all  ten  of  the  systems  studied  with 
coupled  cluster  theory,  and  thus  we  are  unable  to 
determine  whether  this  is  a  fortuitous  result  or  a 
general  trend.  However,  these  results  do  suggest 
that  even  high  correlation  methods  executed  with 
robust  basis  sets  still  benefit  from  fortuitous  error 
cancellation.  Feller  et  al.  [49]  recently  published  a 
careful  study  of  the  sources  of  error  for  electronic 
structure  calculations  on  small  systems.  They  used 
correlation  consistent  basis  sets  ranging  up  to  the 
n  =  10  size  to  show  that  coupled  cluster  calcula¬ 
tions  slowly  converge  on  the  total  energy  as  n  in¬ 
creases.  The  raw  CCSD(T)/cc-pvl0Z  energy  of  the 
Ne  (1S)  atom  remains  0.6  kcal/mol  above  the  esti¬ 
mated  complete  basis  set  limit  [49].  Because  energy 
differences  converge  more  rapidly  than  raw  ener¬ 
gies,  the  bond  dissociation  energy  of  N2  is  con¬ 
verged  to  0.5  kcal/mol  at  the  CCSD(T)/cc-pV7Z 
level  [49].  Using  simple  extrapolation  schemes  ef¬ 
fectively  improves  the  computed  properties  by  an 
equivalent  of  increasing  the  basis  set  size  by  three 
or  more  cardinal  numbers  [49].  This  suggests  that  a 
single-point  calculation  at  the  n  =  7  or  n  =  8  level 
yields  results  similar  to  an  extrapolated  TQ  or  Q5 
scheme.  Additionally,  the  CCSD(T)  method  often 
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produces  results  that  are  in  better  agreement  with 
full  configuration  interaction  than  the  more  expen¬ 
sive  CCSDT  method  [49].  Thus,  we  advocate  the 
use  of  extrapolated  TQ  values,  because  of  their 
relatively  inexpensive,  yet  high  accuracy  predic¬ 
tions.  The  G3,  CBS-APNO,  and  W1  model  chemis¬ 
try  methods  appear  to  be  the  more  efficient  route 
when  calculating  the  thermodynamics  of  deproto¬ 
nation  for  small  organic  molecules. 

Interestingly,  the  coupled  clusters  and  model 
chemistry  results  both  consistently  disagree  with 
certain  experimental  results.  The  NIST  values  for 
AG°  and  AH°  have  been  revised  along  with  the 
error  bars  for  the  experimental  deprotonation  of 
methanol  since  our  first  use  of  this  data  set  [7].  The 
original  error  bars  of  0.6  and  0.4  kcal/mol  in  the 
NIST  database  in  2000  have  been  revised  to  1.1  and 
1.0  kcal/mol,  and  the  results  here  suggest  that  the 
error  bars  are  even  larger,  as  the  value  for  AG° 
should  be  closer  to  376  rather  than  375  kcal/mol. 


Conclusion 

The  model  chemistry  methods  G3,  CBS-APNO, 
and  W1  are  relatively  fast  and  efficient  methods  for 
obtaining  deprotonation  enthalpies  and  free  ener¬ 
gies.  Calculations  at  the  CCSD(T)/aug-cc-pVQZ 
level  are  very  accurate  for  the  17-molecule  test  set. 
Results  are  worse  for  a  coupled  cluster  energy  cal¬ 
culation  using  the  aug-cc-pV5Z  data  set,  stemming 
from  either  better  cancellation  of  errors  for  the  qua- 
druple-zeta  basis  set  or  from  the  effects  of  BSSE. 
Any  of  the  model  chemistry  methods  or  the 
CCSD(T)/aug-cc-pVQZ  method  can  serve  as  a 
valuable  check  on  the  accuracy  of  experimental 
data  reported  in  the  NIST  database. 
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The  hydroperoxy  radical  (HO2)  plays  a  critical  role  in  Earth’s  atmospheric  chemistry  as  a  component  of 
many  important  reactions.  The  self-reaction  of  hydroperoxy  radicals  in  the  gas  phase  is  strongly  affected  by 
the  presence  of  water  vapor.  In  this  work,  we  explore  the  potential  energy  surfaces  of  hydroperoxy  radicals 
hydrogen  bonded  to  one  or  two  water  molecules,  and  predict  atmospheric  concentrations  and  vibrational 
spectra  of  these  complexes.  We  predict  that  when  the  HO2  concentration  is  on  the  order  of  108  molecules'cm-3 
at  298  K,  that  the  number  of  HCV’^O  complexes  is  on  the  order  of  107  molecules*cm  3  and  the  number 
of  H02*#,(H20)2  complexes  is  on  the  order  of  106  molecules-cm-3.  Using  the  computed  abundance  of 
H02— H20,  we  predict  that,  at  298  K,  the  bimolecular  rate  constant  for  H02#**H20  +  HO2  is  about  10  times 
that  for  H02  +  H02. 


Introduction 

The  hydroperoxy  radical  (HO2)  is  a  molecule  that  is  of  interest 
in  many  fields,  including  environmental  chemistry,  astrochem- 
istry,  and  biochemistry.  It  plays  a  significant  role  in  Earth’s 
atmospheric  chemistry  as  a  component  of  several  important 
gas-phase  reactions.  The  formation  of  the  hydroperoxy  radical 
stems  primarily  from  the  OH  radical-initiated  degradation  of 
organic  compounds,  and  the  subsequent  photolysis  of  alde¬ 
hydes:* 1 

H2CO(g)  +  hv  (A  <370  nm)  —  HCO(g)  +  H(g)  ( 1 ) 

H(g)  +  02-H02(g)  (2) 

HCO(g)  +  02  -  H02(g)  +  CO(g)  (3) 

The  self-reaction  of  two  hydroperoxy  radicals  leads  to  the 
production  of  molecular  oxygen  and  hydrogen  peroxide:2 

H02(g)  +  H02(g)  -  HOOH(g)  +  02(g)  (4) 

Once  formed,  hydrogen  peroxide  rapidly  enters  aqueous  aerosols 
where  it  oxidizes  sulfur  dioxide  to  sulfuric  acid.  The  reaction 
of  hydroperoxy  radical  with  nitric  oxide  forms  NO2,  whose 
photolysis  leads  to  formation  of  ozone  as  shown  in  Scheme  1. 

The  hydroperoxy  radical  also  has  a  very  strong  affinity  for 
water.  The  atmospheric  importance  of  this  fact  came  to  light  3 
decades  ago,  when  it  was  determined  that  the  rate  of  hydrop¬ 
eroxy  radical’s  self-reaction  in  the  gas  phase  was  strongly 
affected  by  the  presence  of  water  vapor.2-7  This  was  widely 
suspected  to  result  from  the  formation  of  an 
complex,  and  led  to  one  of  the  first  ab  initio  studies  of  this 
hydrogen  bonded  complex.8  Astronomers  are  also  interested  in 
understanding  the  interaction  between  hydroperoxy  and  water 
molecules  as  related  to  their  interpretation  of  data  obtained  from 
icy  surfaces  in  space.  Radiolysis  of  icy  surfaces  on  comets, 
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SCHEME  1:  Catalytic  Cycles  in  HO*  and  NO* 
Producing  Ozone  While  Oxidizing  Volatile  Organic 
Compounds 


interstellar  dust,  and  satellites  (e.g.,  Jupiter’s  moon  Europa) 
could  result  in  the  chemical  processes  seen  in  hydroperoxy 
radical’s  self-reaction  or  its  reaction  with  water.9,10  The  hydro¬ 
peroxy  radical  is  also  believed  to  be  important  in  several 
biological  processes,  where  it  has  ample  opportunity  for  forming 
weak  interactions  with  water  molecules.11  Thus,  understanding 
the  structure  and  energetics  of  the  HC^'^O  complex  is 
fundamentally  important  for  several  fields  of  study. 

There  has  been  an  indirect  determination  of  the  equilibrium 
constant  for  formation  of  the  HCV’TUO  complex12  and  an 
attempt  to  obtain  its  spectrum.13  However,  no  interpretable 
spectrum  has  been  obtained  for  this  complex.  Recently,  there 
have  been  several  quantum-mechanical  studies  of  a  single 
conformer  of  the  HCV'^O  complex.14  16  Because  of  the  role 
water  has  in  hydroperoxy  radical  reactions,  gaining  a  detailed 
energetic  and  structural  understanding  of  possible  HCV’THO 
and  H02**‘(H20)2  complexes  will  give  additional  insight  into 
the  role  this  hydrogen  bonded  system  plays  in  atmospheric 
chemistry  and  other  phenomena. 

Using  CCSD//MP2  calculations  with  a  variety  of  basis  sets, 
the  equilibrium  constant  (standard  state  of  moleculc*cm3)  for 
reaction  5 
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H02(g)  +  H20(g)  — *  H02*"H20(g)  (5) 

at  room  temperature  has  been  estimated  to  be  between  4  x  10“ 18 
and  1.3  x  10“21. 14,17  Experimentally,  Kanno  and  colleagues 
detennined  the  reaction’s  equilibrium  constant  Kc  to  be  (5.2  ± 
3.2)  x  10“ 19  at  297  K,  which  leads  to  a  concentration  ratio  of 
[H02— H20]/[H02]  of  0.19  ±  0.1 1  at  297  K  and  50%  relative 
humidity.12 

Hydroperoxy  radical’s  self-reaction  has  been  the  subject 
of  three  computational  studies  to  date,1 5,18,19  two  of  which 
have  examined  the  catalytic  role  of  water.  In  addition  to  the 
H02***H20  dimer  complex,  the  H02***(H20)2  trimer  is  of 
interest.  The  trimer  may  be  atmospherically  relevant,  and  serves 
as  a  stepping  stone  to  modeling  bulk  or  surface  hydration  of 
H02. 16,20  As  a  model  for  H02  interacting  with  cloud  droplets, 
Shi  and  co-workers  performed  quantum-mechanical  calculations 
on  the  H02***(H20)2o  complex.20  They  proposed  that  hydrop¬ 
eroxy  radical  reactions  may  occur  on  the  surface  and  in  the 
interior  of  a  cloud  droplet.  Complexes  and  clusters  of  water 
with  oxidants,  including  the  hydroperoxy  radical,  is  the  subject 
of  two  recent  reviews.21,22  Our  goal  is  to  obtain  all  of  the 
potential  configurations  of  H02***H20  and  H02***(H20)2  com¬ 
plexes,  and  determine  their  energies  and  relative  abundances 
in  the  lowermost  troposphere.  This  is  a  fundamental  step  for 
obtaining  a  better  understanding  of  water’s  role  in  the  self¬ 
reaction  of  the  hydroperoxy  radical  and  its  gas-  and  aqueous- 
phase  chemistry. 

Method 

The  initial  H02***H20  configurations  were  built  using  the 
SPARTAN23  software  by  placing  a  water  molecule  about  the 
hydroperoxy  radical  at  three  different  locations.  In  two  of  these 
configurations,  each  radical  oxygen  atom  accepted  a  hydrogen 
bond  from  the  water,  while  the  third  configuration  had  the  water 
accepting  a  hydrogen  bond  from  the  hydroperoxy  radical. 
Additional  configurations  were  generated  using  a  12-fold 
rotation  around  the  hydrogen  bond  of  each  initial  structure,  and 
each  of  these  configurations  was  optimized  with  PM3.  The 
resulting  conformers  were  grouped  based  on  dihedrals  of 
matching  signs  (positive  or  negative)  and  similar  values  (i.e., 
0-8,  8-25,  25-90,  90-120,  120+).  Previous  work  on  small 
water  clusters  has  shown  that  PM3  geometry  optimizations  of 
conformers  with  similar  dihedral  angles  converge  on  a  common 
minimum.24  The  groups  were  not  based  on  energies  because 
most  conformations  from  each  search  possessed  PM3  heat  of 
formation  energies  within  3  kcahmol-1  of  each  other.  We 
performed  a  Hartree— Fock  (HF)  6-3 1G*  optimization  using  the 
Gaussian03  program25  on  the  lowest  energy  member  from  each 
of  these  groups.  The  resulting  HF  energy  values  and  structures 
were  then  compared  to  identify  unique  complexes.  Some  of  the 
structures  picked  from  different  groups  optimized  to  similar 
structures,  and  in  total,  only  two  unique  structures  (dimers  A 
and  C)  were  located.  A  third  dimer  configuration  (dimer  B) 
was  found  by  enforcing  Cs  symmetry  on  the  molecule.  Without 
Cs  symmetry,  this  configuration  quickly  minimizes  to  dimer  C 
in  Figure  1.  Finally,  Gibbs  free  energies  were  obtained  using 
the  Gaussian  (G3)  model  chemistry.26  No  further  corrections 
have  been  made  for  basis  set  superposition  error,  as  the  method 
has  an  inherent  correction  for  basis  set  artifacts.27-29 

We  started  our  search  for  the  H02***(H20)2  complexes  by 
bonding  another  water  molecule  to  the  original  H02***H20 
complexes  in  various  locations.  This  resulted  in  13  new  starting 
structures.  For  the  exploration  of  the  conformations  of  these 
structures,  over  one  hundred  additional  structures  were  generated 


Dimer-A 


Figure  1.  Molecular  structures  of  HO:,  H:0,  and  the  three  HO:*** 
H2O  dimers  determined  at  the  MP2/6-31G*  level  of  theory.  Key 
interatomic  distances  (angstroms)  and  angles  (degrees)  are  given. 


using  either  a  12-fold  or  an  8-fold  rotation  around  the  hydrogen 
bond,  prior  to  PM3  optimization  of  each  conformer.  The 
conformers  were  grouped  based  on  dihedral  angles,  optimized 
with  HF/6-31G*  calculations,  grouped  based  on  energies,  and 
the  remaining  conformers  were  input  to  the  G3  model  chemistry. 
This  approach  generated  four  H02***(H20)2  complexes.  We 
performed  frequency  calculations  on  all  complexes  using  the 
HF/6-31G*  level  of  theory,  and  computed  a  Boltzmann  distribu¬ 
tion  using  our  G3  model  chemistry  free  energies.  All  complexes 
reported  have  been  verified  to  be  stable  minima  on  the  HF/6- 
31G*  potential  energy  surface.  All  calculations  involving 
hydroperoxy  radical  were  done  using  spin-unrestricted  wave 
functions. 


Results  and  Discussion 

Structures.  We  found  three  unique  hydrogen  bonded  com¬ 
plexes  of  the  H02*‘*H20  heterodimer,  including  the  one 
previously  described  in  the  literature  where  the  hydroperoxy 
radical  molecule  is  the  hydrogen  bond  donor  (dimer  A  in  Figure 
1).  The  two  new  dimer  configurations  (dimers  B  and  C)  both 
involve  the  hydroperoxy  radical  acting  as  a  hydrogen  bond 
acceptor. 

Dimer  A  belongs  to  the  C\  point  group  and  has  an  enantiomer. 
Our  MP2(full)/6-31G*  optimized  value  for  the  hydrogen  bond 
length  is  1.785  A,  only  0.004  A  shorter  than  the  MP2/6- 
31 1++G(2df,2pd)  value  of  Aloisio  and  Francisco.14  The 
excellent  agreement  between  MP2(full)/6-31G*  and  MP2/6- 
3 1 1++G(2df,2pd)  results  allows  us  to  have  confidence  in  the 
geometries  of  dimers  B  and  C.  Both  dimers  B  and  C  possess 
Cs  symmetry.  Dimer  B  has  a  hydrogen  bond  length  of  2. 132  A, 
indicating  weaker  attractive  forces  involved  in  this  dimer 
formation  in  comparison  to  dimer  A.  Breaking  the  CT  symmetry 
in  dimer  B  followed  by  an  optimization  forms  dimer  C,  the 
third  and  least  stable  dimer  (rotating  dimer  B's  water  by  180° 
results  in  the  formation  of  dimer  C,  recapturing  Cs  symmetry), 
dimer  C  has  a  2.154  A  hydrogen  bond,  which  is  the  longest 
hydrogen  bond  distance  seen  in  any  of  the  dimer  or  trimer 
complexes. 
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Figure  2.  Molecular  structures  of  the  four  HCb**  *(1+0)2  trimers 
determined  at  the  MP2/6-31G*  level  of  theory.  Key  interatomic 
distances  (angstroms)  and  angles  (degrees)  are  given. 


We  found  four  unique  configurations  for  the 
trimer  complex,  all  of  which  have  the  hydroperoxy  radical  acting 
as  a  hydrogen  bond  donor  and  acceptor,  as  seen  in  Figure  2. 
Each  trimer  possesses  C\  symmetry  and  has  an  enantiomer,  with 
trimers  A,  B,  and  D  existing  as  cyclic  clusters.  The  only 
difference  between  trimers  A  and  B  is  the  orientation  of  the 
hydrogen  on  the  water  molecules.  In  these  trimers,  the  cyclic 
structure  allows  both  oxygens  of  the  hydroperoxy  radical  to 
participate  in  hydrogen  bonding  with  a  water,  one  as  a  hydrogen 
bond  donor  and  one  as  an  acceptor.  By  contrast,  in  trimer  C 
this  is  achieved  with  an  extended  structure;  thus  trimer  C  is  the 
only  cluster  where  the  two  water  molecules  do  not  form  a 
hydrogen  bond  between  each  other.  Trimer  D  is  unique  as  the 
hydroperoxy  radical’s  hydroxyl  oxygen  acts  both  as  a  hydrogen- 
bond  donor  and  acceptor,  while  its  other  oxygen  (the  radical 
center)  does  not  participate  in  any  noncovalent  interactions.  An 
interesting  observation  about  the  trimer  complexes  is  that  the 
hydrogen  bonds  formed  by  hydroperoxy  radical  donating 
hydrogen  are  shorter  than  that  observed  in  dimer  A,  which  has 
a  similar  bonding  motif.  This  is  an  indication  that  the  inclusion 
of  the  second  water  alters  the  electronic  configuration  about 
dimer  A,  allowing  for  a  more  energetically  favorable  interaction 
to  occur.  The  cyclic  trimers  have  the  same  cooperativity 
interactions  as  cyclic  water  clusters  that  have  hydrogen  bonds 
that  all  donate  in  the  same  direction.24-27,30-40  The  most  stable 
H02***(H20)2  clusters  can  be  constructed  simply  by  replacing 
one  of  the  water  molecules  in  the  most  stable  water  trimers24 
with  the  radical. 

Thermochemistry.  As  can  be  seen  in  Table  1,  dimer  A  is 
significantly  more  stable  than  the  two  other  conformers.  For 
dimer  A  Aloisio  and  Francisco  reported  a  CCSD(T)/6-3 1 1 ++G- 


(2df,2p)//B3LY P/6-31 1  ++G(2df,2p)  electronic  energy  for  reac¬ 
tion  5,  A£eiec(0  K),  of -9.4  kcabmol-1  and  after  correcting  for 
zero-point  vibrational  energy,  A£zpve,  the  energy  becomes  -6.9 
kcabmol-1.14  These  values  are  in  excellent  agreement  with  our 
G3  A£eiec(0  K)  and  AZTzpve  values  of —9. 1  and  -7. 1  kcabmol-1. 
Including  thermal  and  entropic  effects  results  in  dimer  A  having 
a  enthalpy,  AH°(298  K),  and  free  energy  for  reaction  5, 
AG°(298  K),  of  -7.4  and  -0.5  kcabmol-1.  Dimers  B  and  C 
have  a  significantly  more  positive  free  energy  for  reaction  5, 
with  values  of  2.4  and  3.1  kcabmor1.  The  fact  that  dimer  A 
consists  of  two  enantiomers  further  lowers  the  effective  free 
energy  by  RT  ln(2),  for  an  effective  AG°(298  K)  of  —0.94 
kcabmol-1.  A  Boltzmann  calculation,  taking  into  account  the 
enantiomer  of  dimer  A,  predicts  the  relative  abundances  at 
298.15  K  of  dimers  A,  B,  and  C  to  be  99.5%,  0.4%,  and  0.1%. 
Dimers  of  hydroperoxy  radical  and  a  water  molecule  will  be 
dominated  by  dimer  A,  where  the  hydroperoxy  radical  acts  as 
the  hydrogen  bond  donor. 

The  calculated  Gibbs  free  energy  for  the  reaction  forming 
the  two  enantiomers  of  dimer  A  is  -0.9  kcabmol-1,  which  is 
in  good  agreement  with  the  experimental  value  of  —  1 .5  (+0.6/— 
0.3)  kcabmol-1  from  Kanno  and  co-workers  as  determined  from 
their  reported  equilibrium  constant.12  The  relative  instability  of 
dimers  B  and  C  means  that  they  contribute  negligibly  to  the 
overall  dimer  population. 

Let  us  now  consider  the  thermodynamics  of  the  formation 
of  the  trimers: 

H02(g)  +  2H20(g)  -  H02-(H20)2(g)  (6) 

Table  1  contains  the  energies  and  the  Boltzmann  distribution 
calculation  at  298  K  for  the  trimers.  As  expected  based  on  the 
strong  similarities  of  their  structures,  trimers  A  and  B  possess 
nearly  the  same  energy,  with  AG°  (298  K)  for  reaction  6  of 
—0.47  and  -0.39  kcabmol-1,  respectively.  Trimers  A  and  B 
are  also  the  most  stable  trimers,  and  are  responsible  for  99%  of 
the  trimer  population.  Hydrogen  bonding  in  these  two  trimers 
occurs  in  a  three-membered  ring,  with  each  moleculc/radical 
acting  as  both  a  hydrogen  bond  donor  and  acceptor.  This  cyclic 
motif  has  been  observed  in  water  clusters,27-35  and  has  been 
attributed  to  the  enhanced  cooperativity  of  these  cyclic  struc¬ 
tures.37 

We  now  consider  the  abundance  of  these  dimers  and  trimers 
in  the  atmosphere  at  298  K.  This  depends  on  the  K?  values  for 
reactions  4  and  5,  the  abundance  of  water  vapor,  and  the 
abundance  of  hydroperoxy  radical: 


^p.dimer  ^complcx^l  IO-^I^C) 

(7a) 

^p.trimcr  ^complcx^l  10,^1 120^ 

(7b) 

where  Pi  are  expressed  in  atmospheres.  Since  there  are  multiple 
dimers  and  trimers,  we  compute  the  ratio  /\ompiex,//^H02  f°r  each 
complex  /',  and  determine  the  fraction  of  the  total  HO2  present 
as  each  complex: 

[^complcx./^lioj^  1  ^/[^c omnlcx./^HO,]}  W 

Values  of  Kp  calculated  from  AG°(298  K)  and  accounting  for 
the  presence  of  enantiomers  are  listed  in  Table  2.  HCL 
concentrations  vary  enormously  in  time  and  space,  but  often 
reach  5  x  108  molecules*cirT3  in  photochemically  active  regions 
of  the  lower  troposphere.  Present  detection  methods41  probably 
detect  HO2—H2O  and  H02***(H20)2  with  roughly  the  same 
efficiency  as  monomer,  so  we  will  take  5  x  108  molecules*cm-3 
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TABLE  1:  Changes  in  G3  Electronic  Energies,  Enthalpies,  Entropies,  Gibbs  Free  Energies,  and  Boltzmann  Distribution  for 
Formation  of  H02**‘(H20)„  for  n  —  1  and  2* 


H02-(H20)„ 

A£elec(0  K) 

A£Zpve(0  K) 

A//°(298  K) 

AG°(298  K) 

AS°  (cal’mol-'*  K"') 

distribution  (298  K) 

n  =  1 
dimer  A 

-9.14 

-7.06 

-7.42 

-0.53 

76.45 

99.5% 

dimer  B 

-3.54 

-2.38 

-2.15 

2.37 

84.42 

0.4% 

dimer  C 

-3.43 

-2.22 

-2.04 

3.12 

82.25 

0.1% 

n  —  Z 
trimer  A 

-20.64 

-15.80 

-16.98 

-0.47 

89.20 

52.5% 

trimer  B 

-20.08 

-15.46 

-16.52 

-0.39 

90.45 

45.9% 

trimer  C 

-13.53 

-10.36 

-10.41 

1.68 

104.04 

1.4% 

trimer  D 

-15.71 

-11.44 

-12.27 

2.91 

93.66 

0.2% 

0  All  energies  reported  are  based  upon  a  standard  state  of  1  atm  and  have  units  of  kcabmol  1 


TABLE  2:  Equilibrium  Constant  (Standard  State  of  1  atm) 
and  Population  of  Free  H02  and  H02”‘(H20)„  for  n  =  1 
and  2  at  100%  Relative  Humidity  for  Assumed  Total  |H02| 
=  5  x  108  molecules’cm-3  at  a  Temperature  of  298.15  Kfl 


species 

Kp  (1  atm) 

N  (molecules*cm  3) 

1I02  monomer 

4.3  x  10s 

n  =  1 

dimer  A  (e) 

2.4  (13  ±  8)A 

6.6  x  107 

dimer  B 

0.018 

2.5  x  105 

dimer  C 

0.052 

6.9  x  104 

n  =  2 

trimer  A  (e) 

4.4 

1.9  x  106 

trimer  B  (e) 

3.9 

1.6  x  106 

trimer  C  (e) 

0.12 

4.9  x  I04 

trimer  D  (e) 

0.015 

6.2  x  103 

a  Equilibrium  constants  are  increased  by  In  2  for  species  with  two 
enantiomers,  where  parenthesis  (e)  indicates  an  enantiomeric  pair. 
h  Reference  12. 


to  be  the  concentration  of  H02  (as  monomer  and  in  all  clusters), 
and  assume  that  only  dimers  and  trimers  contribute  significantly 
to  the  H02  cluster  population.  For  this  calculation  we  will 
assume  a  relative  humidity  (RH)  of  100%  (P^o  =  0.03125  atm). 

Results  of  these  calculations  are  listed  in  Table  2.  We  predict 
that  dimer  A  will  have  a  concentration  of  6.6  x  107 
molecules'cm-3,  and  dimers  B  and  C  will  have  equilibrium 
concentrations  2-3  orders  of  magnitude  lower.  Trimers  A  and 
B  will  have  concentrations  of  1.9  x  106  and  1.6  x  106 
molecules*cm  ~3,  while  trimers  C  and  D  will  have  concentrations 
1.5— 2.5  orders  of  magnitude  lower.  Since  the  hydroperoxy 
radical  concentration  is  much  less  than  that  of  water  vapor,  the 
computed  dimer  and  trimer  concentrations  scale  linearly  with 
the  assumed  total  H02  concentration.  Conversely,  dimer  and 
trimer  concentrations  do  not  scale  linearly  with  RH;  decrease 
of  RH  to  50%  reduces  dimer  concentrations  by  44%  and  trimer 
concentrations  by  73%. 

As  noted  in  the  Introduction,  one  motivation  for  this  study 
was  an  interest  in  how  water  vapor  enhances  the  self-reaction 
of  H02  (reaction  4).  The  observed  effect  is  roughly  linear  with 
water  concentration.  One  commonly  used  recommendation7  is 
that  the  bimolecular  rate  constant  of  reaction  4  is  increased  by 
a  multiplicative  factor: 

k( T,  [H20])  = 

*(T,  [H20]  =  0)  {1  +  1.4  x  10~2l[H2O]e+2200/r}  (9) 

where  [H20]  is  in  molecules*cm-3.  It  is  widely  assumed  that 
the  increase  in  rate  constant  is  due  to  the  reaction: 

H02-H20(g)  +  H07(g)  -  HOOH(g)  +  H20(g)  +  02(g) 

(10) 

If  we  make  the  simple  assumption  that  substituting  dimer  A 


for  H02  monomer  increases  the  reaction  rate  by  an  enhancement 
factor,  E,  due  to  the  fraction,  f  of  dimer  A  present,  then  at 
298.15  K  eq  9  is  related  to  E  and  /by 

{1  +  1.4  x  10_2l[H2O]e+2200/29815}  =  (1  —J)  +  fE  (11) 

The  formation  of  dimer  A  reduces  the  concentration  of  H02 
monomer  by  a  fraction  /,  thereby  causing  a  corresponding 
decrease  in  the  rate  of  the  self-reaction  of  H02  monomer.  The 
(1  —  f)  term  on  the  right-hand  side  corresponds  to  this  fractional 
decrease.  We  then  solve  for  E  using  the  range  of  RH  used  in 
experiment  (0-55%).  Using  the  values  of  Kp  computed  here  to 
determine  f  we  obtain  E  —  10.5.  Using  Kanno’s  value  of 
Kp  =  13  ±  8,  we  compute  an  enhancement  factor  of  2-10 
(4  [+6,  -2]). 

Vibrational  Spectra.  The  calculated  and  experimental  fre¬ 
quencies  for  the  water  and  hydroperoxy  monomers,  and  the  three 
dimer  complexes  are  given  in  Table  3.  The  scaled  HF/6-31G* 
frequencies  for  the  water  monomer  agree  very  well  with  the 
experimental  frequencies,  with  a  maximum  error  of  36  cm-1. 
The  frequency  of  the  V\  mode  of  the  hydroperoxy  monomer 
should  be  diagnostic  of  hydrogen  bond  donation  by  H02,  so  it 
would  be  helpful  to  be  able  to  rely  on  the  computed  frequencies 
of  this  mode  in  the  complexes.  Unfortunately,  the  scaled 
frequency  we  obtain  at  the  HF/6-31G*  level  overestimates  the 
experimental  frequency  by  172  cm-1. 

The  experimental  OH  stretching  frequency  of  the  H02 
molecule  in  the  dimer  complex  occurs  as  a  strong  infrared  peak 
at  3236  cm  '1,  redshifted  by  177  cm-1  with  respect  to  hydro¬ 
peroxy  monomer  (both  in  an  Ar  matrix).42  Computations  indicate 
a  redshift  of  101  cm-1  for  dimer  A,  but  only  3  and  1  cm-1  for 
dimers  B  and  C.  Clearly,  the  redshift  seen  in  the  experimental 
spectrum  of  the  dimer  is  only  consistent  with  dimer  A.  Note 
that  the  Ar  matrix  only  redshifts  i>i  of  hydroperoxy  monomer 
by  ~20  cm  *1  with  respect  to  the  gas-phase  value.43-44 

The  scaled  vibrational  frequencies  for  the  four  trimers  are 
reported  in  Table  4  along  with  their  relative  infrared  and  Raman 
intensities.  The  “a”  and  “d”  by  the  mode  labels  in  Table  4 
indicate  whether  the  species  involved  is  acting  as  a  hydrogen 
bond  acceptor  or  donor  in  that  mode.  The  intermolecular 
frequencies  all  exist  below  1000  cm-1,  and  are  complex  motions 
involving  two  or  more  of  the  constituent  molecules.  Unlike  the 
case  of  the  dimers,  it  is  only  within  these  modes  that  the  trimers 
exhibit  large  differences  in  frequencies  between  the  different 
conformers,  most  notably  in  the  H  wagging  modes. 

Detecting  these  dimer  and  trimer  species  is  a  difficult 
experimental  task  because  of  their  low  abundance  and  the 
potential  for  their  rovibrational  spectra  to  overlap  each  other’s 
spectra,  as  well  as  the  spectra  of  the  monomers  and  pure  water 
clusters.  Fortunately,  there  are  a  few  modes  that  can  be  used  to 
distinguish  dimer  A  from  trimers  A  and  B.  Table  5  summarizes 
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TABLE  3:  Sealed  Vibrational  Frequencies  (cm  l)  of  H02,  H20,  and  Complex  at  HF/6-31G*  (Scaled  by  0.8929,  with 


Experimental  Values  in  Parentheses  and  IR  and  Raman  Intensities  in  Brackets)" 

mode* 

H20 

ho2 

dimer  A 

dimer  B 

dimer  C 

U3(H20) 

3740  (3756 )c  (3734)'' 

3730  [m,m]  (3691/ 

3734  [m,m] 

3642  [w,s] 

u,(H.O) 

3634  (3657)r  (3638)rf 

3629  [w,s]  (3501.5/ 

3633  [w,m] 

3738  [w,m] 

Ui(HO,) 

3585  (3413.0 / 

3483  [vs,s]  (3236.2/ 

3582  [w,s] 

3585  [w,s] 

u2(H20) 

1631  (1595)' 

1627  [s,w] 

1643  [m,w] 

1646  [m,w] 

u2(H02) 

1449  (1388.9)*' 

1 538  [m,w](  1479.3/ 

1454  [w,w] 

1447  [w,w] 

u3(H02) 

1117(1100.8)' 

1126  [w,w]  (1 120.4/ 

1125  [w,w] 

1117  [w,w] 

H  wag 

562  [s,0] 

334  [vs,w] 

312  [vs,w] 

H"*0  stretch 

332  [s,w] 

91.37  [w,0] 

136  [0,0] 

rock  (H20) 

229  [s,w] 

180  [m,w] 

136  [m,w] 

deformation 

207  [w,0] 

120  [0,0] 

107  [w,w] 

deformation 

94  [m,w] 

37  [0,0] 

98  [0,0] 

rock  (H02  and  H20) 

59  [w,0] 

34  [m,0] 

38  [w,0] 

"  Intensities  are  listed  as  very  strong  (vs),  strong  (s),  medium  (m),  weak  (w),  or  negligible  (0).  h  Mode  motion:  W|(H20)  for  OH  symmetric 
stretching  in  H20;  v2(H20)  for  HOH  bending  in  H20;  u3(H20)  for  OH  asymmetric  stretching  in  H20;  U|(H02)  for  H-0  stretching  in  H02;  u2(H02) 
for  HOO  bending  in  H02;  u3(H02)  for  0-0  stretching  in  H02.  c  Reference  46.  d  Reference  47.  '  Reference  48.  f  Reference  49. 


TABLE  4:  Scaled  Vibrational  Frequencies  (cm  *)  of  H02***(H20)2  Complexes  at  HF/6-31G*  (Scaled  by  0.8929,  with  IR  and 
Raman  Intensities  in  Brackets") 


mode" 

trimer  A 

trimer  B 

trimer  C 

trimer  D 

u3(H20  d) 

3717  [m,m] 

3721  [m,m] 

3733  [w,s] 

3722  [m,m] 

t>i(H20  a) 

3532  [s,s] 

3544  [s,s] 

3732  [m,s] 

3561  [s,s] 

Ui(H20  d) 

3610  [m,s] 

3613  [m,s] 

3642  [w,s] 

3617  [w,s] 

u3(H20  a) 

3702  [m,m] 

3706  [m,m] 

3632  [w,s] 

3709  [m,m] 

Vi(H02) 

3373  [vs, vs] 

3379  [vs, vs] 

3455  [vs, vs] 

3430  [vs,s] 

u2(H20  d) 

1637  [m,w] 

1635  [m,w] 

1652  [s,m] 

1636  [w,w] 

u2(H20  a) 

1 649  [m,w] 

1656  [m,w] 

1626  [m,w] 

1649  [m,w] 

u2(H02) 

1568  [m,w] 

1570  [m,w] 

1536  [w,vw] 

1512  [m,w] 

u3(H02) 

1139  [w,w] 

1139  [w,w] 

1 128  [w,vw] 

1127  [w,w] 

H  wag 

721  [m,w] 

646  [m,w] 

621  [m,0] 

711  [w,0] 

H  wag 

665  [m,0] 

659  [m,w] 

334  [m,vw] 

570  fvs,w] 

H  wag  (H20  a,  d) 

455  [m,w] 

403  [m,0] 

297  [w,vw] 

390  [s,w] 

O**0  stretch  (H20  a,  H02) 

263  [m,w] 

240  [w,0] 

234  [m,vw] 

212  [w,0] 

H  wag 

355  [m,w] 

262  [m,w] 

215  [m,0] 

346  [m,0] 

H  twist  (H20  d) 

246  [w,w] 

384  [m,w] 

140  [w,vw] 

242  [w,0] 

rock  (H20  a  &  d) 

229  [w,0] 

197  [w,w] 

130  [w,0] 

193  [m,w] 

0***0  stretch  (H20  a  &  d) 

199  [w,0] 

191  [w,w] 

112  [w,0] 

172  [w,0] 

hydrogen  motion 

1 64  [m,w] 

133  [m,0] 

65  [w,0] 

103  [m,w] 

hydrogen  motion 

65  [w,0] 

54  [w,w] 

47  [w,vw] 

124  [w,0] 

heavy  atom  motion 

151  [w,w] 

156  [w,w] 

25  [w,0] 

36  [w,0] 

heavy  atom  motion 

104  [w,0] 

100  [w,0] 

22  [w,0] 

87  [w,w] 

"  See  Table  3  for  description  of  intensities  and  mode  motion. 

TABLE  5:  HF/6-31G*  Scaled  Infrared  and  Raman  Absorptions  for  the  Low  Energy  Clusters  H02***(H20/ 

,  for  n  =  1  and  2, 

along  with  the  Respective  Monomers" 

modes  H02  H20 

h2o—h2oa  H20— (H20)/ 

dimer  A 

trimer  A 

trimer  B 

u,  (H20)  3634  [m,  na] 

3630  [m,w]  3562  [s,m];  3558  [s,m]; 

3629  [w,s]  (— 5//na) 

3532  [s,s]  (—102//— 97) 

3544  [s,s]  (— 90//-F12) 

3519  [w,s] 

u,  (H02)  3585  [m,na] 

3483  [vs,s]  (— 102//na) 

3373  [vs, vs]  (—212//— 1 10) 

3379  [vs, vs]  (— 206//+6) 

H  motion 

721  [m,w] 

646  [m,w]  (na//— 75) 

665  [m,0] 

659  [m,w]  (na//— 19) 

553  [m,w]  598  [s,w] 

562  [s,0] 

455  [m,w]  (na//—  1 07) 

403  [m,0]  (na//— 52) 

342  [m.wj  407  [m,w];  3 14  [w,w]; 

355  [m,w] 

262  [m.w]  (na//— 93) 

296  [w,w];  221  [w,w] 

"  In  parentheses  is  given  the  spectral  shift  from  the  monomer//shift  from  the  preceding  cluster,  where  a  positive  value  indicates  a  blue  shift  and 
a  negative  value  indicates  a  red  shift.  h  Reference  45. 


this  information,  showing  that  dimer  A’s  hydroperoxy  radical’s 
O-H  stretch  will  be  red  shifted  by  approximately  100  cm"1 
from  that  of  free  hydroperoxy  and  will  have  a  very  intense 
infrared  peak.  Trimers  A  and  B  will  have  this  mode,  further 
red  shifted  by  another  100  cm-1,  again  with  very  strong  infrared 
absorption  bands.  Similarly,  water’s  symmetric  O— H  stretch 
in  dimer  A  will  be  unchanged  from  that  of  free  water,  but  if 
trimers  A  or  B  form,  a  strong  infrared  peak  will  appear  red 
shifted  by  approximately  100  cm  '1.  In  terms  of  the  symmetric 
O—H  stretching  mode  of  the  water  molecule  (t>i(H20))  and 
hydrogen  wagging  mode  of  the  complex,  the  unique  infrared 


peaks  for  dimer  A  and  trimers  A  and  B  will  be  masked  by 
vibrational  modes  that  arise  from  the  formation  of  the 
H2O  and  H2O"(H20)2  complexes,45  as  seen  in  Table  5. 

Conclusion 

Our  thorough  search  of  conformational  space  discovered  two 
unique  HCL’^fbO  complexes  in  which  hydroperoxy  acts  as  a 
hydrogen  bond  acceptor,  in  addition  to  the  previously  described 
HO2—H2O  complex  in  which  hydroperoxy  acts  as  a  hydrogen 
bond  donor.  The  donor  complex  is  the  most  abundant  of  the 
three,  resulting  from  a  more  favorable  free  energy  of  formation 


H02— (H20)„  (n  =  1—2)  Hydrogen  Bonded  Complexes 

for  the  two  enantiomers  of  this  complex.  Our  computed  binding 
energies  and  free  energies  were  very  similar  to  those  obtained 
from  previous  computations.  Our  free  energy  of  formation  is 
well  within  the  error  bars  of  the  experimental  value.  We 
identified  two  cyclic  trimers  that  are  very  similar  in  energy. 
The  most  favorable  hydrogen-bonding  motif  has  both  hydrop- 
eroxy  radical’s  oxygens  participating  in  hydrogen  bonding,  one 
as  a  hydrogen  bond  donor  and  the  other  as  an  acceptor,  with 
the  waters  forming  a  hydrogen  bond  between  them.  This  same 
motif  has  been  observed  in  cyclic  water  clusters,27*35  and  has 
been  attributed  to  the  enhanced  cooperativity  of  these  cyclic 
structures.37  We  predict  that  when  the  H02  concentration  is  on 
the  order  of  108  molecules*cm“3  in  the  lower  troposphere  at 
298  K,  that  the  number  of  H02"*H20  complexes  is  on  the  order 
of  107  molecules*cm~3  and  the  number  of  H02***(H20)2 
complexes  is  on  the  order  of  106  molecules*cm-3. 

Acknowledgment.  Acknowledgment  is  made  to  the  donors 
of  the  Petroleum  Research  Fund,  administered  by  the  American 
Chemical  Society,  to  the  Research  Corporation,  to  NIH,  to  DOD, 
and  to  Hamilton  College  for  support  of  this  work.  This  project 
was  supported  in  part  by  NSF  Grant  CHE-0457275,  and  by  NSF 
Grants  CHE-01 16435  and  CHE-0521063  as  part  of  the  MER¬ 
CURY  high-performance  computer  consortium  (http://mercu- 
ry.chem.hamilton.edu).  T.S.D.  would  like  to  acknowledge 
support  from  the  National  Science  Foundation  under  Grant 
ATM-0352926. 


Supporting  Information  Available:  Tables  containing 
energy  values  and  coordinates  from  the  final  orientation  for 
dimer  A,  dimer  B,  dimer  C,  trimer  A,  trimer  B,  trimer  C,  trimer 
D,  water,  and  H02  radical.  This  material  is  available  free  of 
charge  via  the  Internet  at  http://pubs.acs.org. 


References  and  Notes 

(1)  Wayne,  R.  P.  Chemistry  of  Atmospheres,  2nd  cd.;  Oxford  University 
Press:  Oxford,  U.K.,  1991. 

(2)  Sander,  S.  P.;  Peterson,  M.;  Watson.  R.  T.;  Patrick,  R.  J.  Phvs. 
Chem.  1982,  86,  1236. 

(3)  Hamilton,  J.  E.,  Jr.;  Lii,  R.-R.  Inf.  J.  Chem.  Kinet.  1977,  9,  875. 

(4)  DcMore,  W.  B.  J.  Phys.  Chem.  1979,  83,  1 1 13. 

(5)  Cox,  R.  A.;  Burrows,  J.  P.  J.  Phys.  Chem.  1979,  83,  2560. 

(6)  Lii,  R.-R.;  Sauer,  J.;  Myran,  C.;  Gordon,  S.  J.  Phys.  Chem.  1981, 
85,  2833. 

(7)  Kircher,  C.  C.;  Sander,  S.  P.  J.  Phvs.  Chem.  1984,  88,  2082. 

(8)  Hamilton,  J.  E.,  Jr.;  Nalcway,  C.  A.  J.  Phvs.  Chem.  1976,  80,  2037. 

(9)  Pan,  X.;  Bass,  A.  D.;  Jay-Gerin,  J.-P.;  Sanchc,  L.  Icarus  2004,  172, 

521. 

(10)  Baragiola,  R.  A.  Nucl.  Instrum.  Methods  Phys.  Res.  Sect.  B-Beam 
Interact.  Mater.  At.  2005,  232,  98. 

(11)  De  Grey,  A.  DNA  Cell  Biol.  2002,  21,  251. 

(12)  Kanno,  N.;  Tonokura,  K.;  Tczaki,  A.;  Koshi,  M.  J.  Phys.  Chem.  A 
2005,  109 ,  3153. 

(13)  Aloisio,  S.;  Francisco,  J.  S.;  Fricdl,  R.  R.  J.  Phvs.  Chem.  A  2000, 
104,  6597. 

(14)  Aloisio,  S.;  Francisco,  J.  S.  J.  Phys.  Chem.  A  1998,  102,  1899. 

(15)  Zhu,  R.  S.;  Lin,  M.  C.  Chem.  Phys.  Lett.  2002,  354,  217. 

(16)  Bclair,  S.  D.;  Kais,  S.;  Francisco,  J.  S.  Mol.  Phys.  2002.  100,  247. 


J.  Phys.  Chem.  A,  Vol.  110,  No.  10,  2006  3691 

(17)  Lendvay,  G.  Z.  Phys.  Chem.-Int.  J.  Res.  Phvs.  Chem.  Chem.  Phvs. 

2001,  215,  377. 

(18)  Zhu,  R.  S.;  Lin,  M.  C.  Physchemcomm  2001. 

(19)  Zhu,  R.  S.;  Lin,  M.  C.  Physchemcomm  2003,  6,  51. 

(20)  Shi,  Q.  C.;  Bclair,  S.  D.;  Francisco.  J.  S.;  Kais,  S.  Proc.  Natl.  Acad. 
Sci.  U.S.A.  2003,  100,  9686. 

(21)  Scnnikov,  P.  G.;  Ignatov,  S.  K.;  Schrcms,  O.  Chemphyschem  2005, 
6,  392. 

(22)  Hansen,  J.  C.;  Francisco,  J.  S.  Chemphyschem  2002,  3,  833. 

(23)  SPARTAN.  Spartan;  5.1.3  ed.;  Wavcfunction,  Inc.:  Irvine,  CA 
92612,  1998. 

(24)  Day,  M.  B.;  Kirschner,  K.  N.;  Shields,  G.  C.  J.  Phys.  Chem.  A 
2005,  109,  6773. 

(25)  Frisch,  M.  J.;  Trucks,  G.  W.;  Schlcgcl,  H.  B.;  Scuscria,  G.  E.;  Robb, 

M.  A.;  Chccscman,  J.  R.;  Montgomery,  J.  A.,  Jr.;  Vrcvcn,  T.;  Kudin,  K. 

N. ;  Burant,  J.  C.;  Millam,  J.  M.;  Iyengar,  S.  S.;  Tomasi,  J.;  Barone,  V.; 
Mcnnucci,  B.;  Cossi,  M.;  Scalmani,  G.;  Rcga,  N.;  Pctersson,  G.  A.; 
Nakatsuji,  H.;  Hada,  M.;  Ehara,  M.;  Toyota,  K.;  Fukuda,  R.;  Hascgawa,  J.; 
Ishida,  M.;  Nakajima,  T.;  Honda,  Y.;  Kitao,  O.;  Nakai,  H.;  Klcnc,  M.;  Li, 
X.;  Knox,  J.  E.;  Hratchian,  H.  P.;  Cross,  J.  B.;  Bakkcn,  V.;  Adamo,  C.; 
Jaramillo,  J.;  Gompcrts,  R.;  Stratmann,  R.  E.;  Yazyev,  O.;  Austin,  A.  J.; 
Cammi,  R.;  Pomclli,  C.;  Ochtcrski,  J.  W.;  Ayala,  P.  Y.;  Morokuma,  K.; 
Voth,  G.  A.;  Salvador,  P.;  Danncnbcrg,  J.  J.;  Zakrzcwski,  V.  G.;  Dapprich, 
S.;  Daniels,  A.  D.;  Strain,  M.  C.;  Farkas,  O.;  Malick,  D.  K.;  Rabuck,  A. 
D.;  Raghavachari,  K.;  Forcsman,  J.  B.;  Ortiz,  J.  V.;  Cui,  Q.;  Baboul,  A. 
G.;  Clifford,  S.;  Cioslowski,  J.;  Stcfanov,  B.  B.;  Liu,  G.;  Liashcnko,  A.; 
Piskorz,  P.;  Komaromi,  I.;  Martin,  R.  L.;  Fox,  D.  J.;  Keith,  T.;  Al-Laham, 
M.  A.;  Peng,  C.  Y.;  Nanayakkara,  A.;  Challacombc,  M.;  Gill,  P.  M.  W.; 
Johnson,  B.;  Chen,  W.;  Wong,  M.  W.;  Gonzalez,  C.;  Pople,  J.  A.  Gaussian 
03,  revision  C.02.  Gaussian,  Inc.:  Wallingford,  CT,  2004. 

(26)  Curtiss,  L.  A.;  Raghavachari,  K.;  Rcdfem,  P.  C.;  Rassolov,  V.; 
Pople,  J.  A.  J.  Chem.  Phys.  1998,  109,  7764. 

(27)  Dunn,  M.  E.;  Pokon,  E.  K.;  Shields,  G.  C.  J.  Am.  Chem.  Soc.  2004, 
126,  2647. 

(28)  Pickard,  F.  C.;  IV;  Dunn,  M.  E.;  Shields,  G.  C.  J.  Phys.  Chem.  A 
2005,  109,  4905. 

(29)  Pickard,  F.  C.;  IV;  Pokon,  E.  K.;  Liptak,  M.  D.;  Shields,  G.  C.  J. 
Chem.  Phys.  2005,  024302. 

(30)  Tsai,  C.  J.;  Jordan,  K.  D.  J.  Phys.  Chem.  1993,  97,  5208. 

(31)  Xanthcas,  S.  S.  J.  Chem.  Phys.  1994,  100,  7523. 

(32)  Xantheas,  S.  S.  J.  Chem.  Phys.  1995,  102,  4505. 

(33)  Grucnloh,  C.  J.;  Carney,  J.  R.;  Arrington,  C.  A.;  Zwicr,  T.  S.; 
Fredericks,  S.  Y.;  Jordan,  K.  D.  Science  1997,  276,  1678. 

(34)  Ugaldc,  J.  M.;  Alkorta,  I.;  Elgucro,  J.  Angew.  Chem.,  Int.  Ed.  2000, 
39,  717. 

(35)  Dunn,  M.  E.;  Pokon,  E.  K.;  Shields,  G.  C.  Int.  J.  Quantum  Chem. 

2004,  100,  1065. 

(36)  Day,  M.  B.;  Kirschner,  K.  N.;  Shields,  G.  C.  Int.  J.  Quantum  Chem. 

2005,  102 ,  565. 

(37)  Xanthcas,  S.  S.  NATO  ASI  Series,  Ser.  C:  Math.  Phvs.  Sci.  2000, 
561,  1 19. 

(38)  Kcutsch,  F.;  Saykally,  R.  J.  Proc.  Natl.  Acad.  Sci.  U.S.A.  2001, 
98,  10533. 

(39)  Xanthcas,  S.  S.;  Burnham,  C.  J.;  Harrison,  R.  J.  J.  Chem.  Phys. 

2002,  116,  1493. 

(40)  Ohno,  K.;  Okimura.  M.;  Akai,  N.;  Katsumoto,  Y.  PCCP  Phys. 
Chem.  Chem.  Phys .  2005,  7,  3005. 

(41)  Heard,  D.  E.;  Pilling,  M.  J.  Chem.  Rev.  2003.  103,  5163. 

(42)  Milligan,  D.  E.;  Jacox,  M.  E.  J.  Chem.  Phys.  1963,  38,  2627. 

(43)  Yamada,  C.;  Endo,  Y.;  Hirota,  E.  J.  Chem.  Phys.  1983.  78,  4379. 

(44)  Thompson,  W.  E.;  Jacox,  M.  E.  J.  Chem.  Phys.  1989,  91,  3826. 

(45)  Dunn,  M.  E.;  Evans,  T.  M.;  Kirschner,  K.  N.;  Shields,  G.  C.  J. 
Phys.  Chem.  A  2005,  110,  303. 

(46)  Fraley,  P.  E.;  Narahari-Rao,  K.  J.  Mol.  Spectrosc.  1969,  29,  348. 

(47)  Engdahl,  A.;  Nelandcr,  B. ,/.  Mol.  Struct.  1989,  193,  101. 

(48)  Jacox,  M.  E.  J.  Phvs.  Chem.  Ref.  Data  1994,  1. 

(49)  Nelandcr,  B.  J.  Phys.  Chem.  A  1997,  101,  9092. 


